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TITLE OF THE INVENTION 

DNA MOLECULES ENCODING HUMAN NHL, A DNA HELICASE 

10 CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims the benefit, under 35 U.S.C. §1 19(e), of U.S. 
provisional application 60/169,970 filed December 9, 1999. 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
15 Not Applicable 

REFERENCE TO MICROFICHE APPENDIX 
Not Applicable 

20 

FIELD OF THE INVENTION 

The present invention relates in part to isolated nucleic acid molecules 
(polynucleotides) which encode NHL, a putative DNA helicase. The present 
invention also relates to recombinant vectors and recombinant hosts which contain a 

25 DNA fragment encoding NHL, substantially purified forms of associated NHL, 
associated mutant proteins, and methods associated with identifying compounds 
which modulate NHL, which will be useful in the treatment of various neoplastic 
disorders, given that this gene is located at 20ql3.3 and immediately adjacent to 
M68/DcR3, which is involved in tumor growth. Also included within the present 

30 invention is a human genomic fragment representing this portion of the human 
genome, along with three additional genes (M68/DcR3, SCLIP, and ARP). 
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BACKGROUND OF THE INVENTION 

Naumovski et al. (1985, Mol. Cell Biol. 5:17-26; Reynolds et al. (1985 Nucleic 
5 Acid Res 13:2357-2372) and Weber et al. (1990 EMBO J. 9: 1437-1447) disclose 
members of the RAD3/ERCC2 gene family of DNA helicases. 

It is known that several chemotherapeutic agents inhibit helicases, including 
actinomycin CI, daunorubicin and nogalamycin (Tuteja, et al M 1997, Biochem. 
Biophys. Res. Comm. 236(3):636-640), and a prostate cancer drug, CI-958 (Lun, et 
10 al.,1998, Cancer Chemother. Pharmacol. 42(6):447-453). In addition, some 
topoisomerases have been shown to have anti-cancer activity. 

Despite the identification of the aforementioned helicase-encoding genes and 
chemotherapeutic agents, it would be advantageous to identify additional genes which 
reside within chromosomal regions associated with a disease state such as cancer as 
15 well as a gene which encodes a type of protein which may be associated with that 
disease. The present invention addresses and meets this need by disclosing a DNA 
molecule encoding a DNA helicase with a chromosomal location suggestive of 
association with cancer. 



20 SUMMARY OF THE INVENTION 

The present invention relates to an isolated or purified nucleic acid molecule 
(polynucleotide) which encodes a novel mammalian DNA helicase. 

The present invention also relates to an isolated nucleic acid molecule 
(polynucleotide) which encodes mRNA which expresses a novel human DNA 
25 helicase, NHL. 

A preferred aspect of the present invention relates to an isolated or purified 
DNA molecule which encodes human NHL, the nucleotide sequence as set forth in 
Figure 1A-B and SEQ ID NO:L 

The present invention also relates to biologically active fragments or mutants 
30 of SEQ ID NO: 1 which encode a mRNA molecule expressing a novel DNA helicase, 
NHL. Any such biologically active fragment and/or mutant will encode either a 
protein or protein fragment which at least substantially mimics the biological 
properties of the human NHL protein disclosed herein in Figure 2 and as set forth as 
SEQ ID NO:2. Any such polynucleotide includes but is not necessarily limited to 



-2- 



WO 01/42434 



PCT/US00/33065 



nucleotide substitutions, deletions, additions, amino-terminal truncations and carboxy- 
terminal truncations such that these mutations encode mRNA which express a 
functional NHL protein in a host cell, so as to be useful for screening for agonists 
and/or antagonists of NHL activity. 
5 The present invention also relates to recombinant vectors and recombinant 

hosts, both prokaryotic and eukaryotic, which contain the substantially purified 
nucleic acid molecules disclosed throughout this specification. 

The present invention also relates to a substantially purified form of a human 
NHL protein which comprises the amino acid sequence disclosed in Figure 2 and set 
10 forth as SEQ ID NO:2. 

A preferred aspect of this portion of the present invention is a NHL protein 
which consists of the amino acid sequence disclosed in Figure 2 and set forth as SEQ 
E)NO:2. 

Another preferred aspect of the present invention relates to a substantially 
15 purified NHL protein, preferably a human NHL protein, obtained from a recombinant 
host cell containing a DNA expression vector comprises a nucleotide sequence as set 
forth in SEQ ID NO: 1 and expresses the respective NHL protein. It is especially 
preferred is that the recombinant host cell be a eukaryotic host cell, such as a 
mammalian cell line. 

20 The present invention also relates to biologically active fragments and/or 

mutants of a NHL protein comprising the amino acid sequence as set forth in SEQ ID 
NO:2, including but not necessarily limited to amino acid substitutions, deletions, 
additions, amino terminal truncations and carboxy-terminal truncations such that these 
mutations provide for proteins or protein fragments of diagnostic, therapeutic or 

25 prophylactic use and would be useful for screening for selective modulators, including 
but not limited to agonists and/or antagonists for human NHL pharmacology. 

A preferred aspect of the present invention is disclosed in Figure 2 and is set 
forth as SEQ ID NO:2, a respective amino acid sequence which encodes human NHL. 
Characterization of one or more of these DNA helicase-Iike proteins allows for 

30 screening methods to identify novel NHL modulators that may be useful in the 
treatment of human neoplastic disorders. The modulators selected through such 
screening and selection protocols may be used alone or in conjunction with other 
cancer therapies. As noted above, heterologous expression of a NHL protein will 
allow the pharmacological analysis of compounds which modulate NHL activity and 
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hence may be useful in various cancer therapies. To this end, heterologous cell lines 
expressing a NHL protein can be used to establish functional or binding assays to 
identify novel NHL modulators. 

The present invention also relates to polyclonal and monoclonal antibodies 
raised in response to either the NHL or a biologically active fragment of NHL. 

The present invention relates to transgenic mice comprising altered genotypes 
and phenotypes in relation to NHL and its in vivo activity. 

The present invention also relates to NHL fusion constructs, including but not 
limited to fusion constructs which express a portion of the NHL protein linked to 
various markers, including but in no way limited to GFP (Green fluorescent protein), 
the MYC epitope, and GST. Any such fusion constructs may be expressed in the cell 
line of interest and used to screen for NHL modulators. 

Therefore, the present invention relates to methods of expressing mammalian 
NHL, and preferably human NHL, biological equivalents disclosed herein, assays 
employing these gene products, recombinant host cells which comprise DNA 
constructs which express these proteins, and compounds identified through these 
assays which act as agonists or antagonists of NHL activity. 

The present invention also relates to the isolated genomic sequence which 
comprises SEQ ID NO: 1, a 1 15 kb genomic fragment set forth herein as SEQ ED 
NO:3. As especially preferred aspect of this portion of the invention is the region of 
the genomic fragment of SEQ ID NO:3 which comprises the regulatory and coding 
regions of human NHL, as well as intervening sequences (introns). This 115 kb 
fragment contains at least the coding region of four genes, NHL, M68/DcR3, SCLIP 
and ARP. As discussed herein, it has been shown that this region of chromosome 20 
is associated with tumor growth. Therefore, an aspect of this invention also comprises 
the use of one or more regions of this 1 15 kb genomic sequence to identify 
compounds which up or downregulate expression of one or more of the genes 
localized within this 1 15 kb region, wherein this up or down regulation results in an 
interference of tumor growth. For example, a transcription element of one of these 
four genes may be responsible for M68/DcR3 ( and/or NHL) overexpression in 
tumors, and if M68 or NHL overexpression in tumors has a caustic role, blockage of 
M68/DcR3 or NHL overexpression in tumors by interfering with this transcription site 
will be useful. 
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It is an object of the present invention to provide an isolated nucleic acid 
molecule (e.g., SEQ ID NO:l) which encodes novel form of human NHL, or 
fragments, mutants or derivatives of human NHL as set forth in Figure 2 and SEQ ID 
NO:2. Any such polynucleotide includes but is not necessarily limited to nucleotide 
5 substitutions, deletions, additions, amino-terminal truncations and carboxy-terminal 
truncations such that these mutations encode mRNA which express a protein or 
protein fragment of diagnostic, therapeutic or prophylactic use and would be useful 
for screening for selective modulators of human NHL activity. 

It is a further object of the present invention to provide the mammalian, and 
10 especially human, NHL proteins or protein fragments encoded by the nucleic acid 
molecules referred to in the preceding paragraph. 

It is a further object of the present invention to provide recombinant vectors 
and recombinant host cells which comprise a nucleic acid sequence encoding 
mammalian, and especially human, NHL protein and biological equivalent thereof. 
15 It is an object of the present invention to provide a substantially purified form 

of human NHL, as set forth in Figure 2 and SEQ ID NO:2. 

Is another object of the present invention to provide a substantially purified 
recombinant form of a NHL protein which has been obtained from a recombinant host 
cell transformed or transfected with a DNA expression vector which comprises and 
20 appropriately expresses a complete open reading frame as set forth in SEQ ED NO: 1, 
resulting in a functional, processed form of NHL. It is especially preferred is that the 
recombinant host cell be a eukaryotic host cell, such as a mammalian cell line. 

It is an object of the present invention to provide for biologically active 
fragments and/or mutants of mammalian, and especially human, NHL, such as set 
25 forth in SEQ ID NO:2, including but not necessarily limited to amino acid 

substitutions, deletions, additions, amino terminal truncations and carboxy-terminal 
truncations such that these mutations provide for proteins or protein fragments of 
diagnostic, therapeutic and/or prophylactic use. 

It is also an object of the present invention to use NHL proteins or biological 
30 equivalent to screen for modulators, preferably selective modulators, of human NHL 
activity. Any such compound may be useful in screening for and selecting compounds 
active against human neoplastic disorders. 

As used herein, "substantially free from other nucleic acids" means at least 
90%, preferably 95%, more preferably 99%, and even more preferably 99.9%, free of 
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other nucleic acids. Thus, a human NHL DNA preparation that is substantially free 
from other nucleic acids will contain, as a percent of its total nucleic acid, no more 
than 10%, preferably no more than 5%, more preferably no more than 1%, and even 
more preferably no more than 0.1%, of non-NHL nucleic acids. Whether a given 

5 NHL DNA preparation is substantially free from other nucleic acids can be 

determined by such conventional techniques of assessing nucleic acid purity as, e.g., 
agarose gel electrophoresis combined with appropriate staining methods, e.g., 
ethidium bromide staining, or by sequencing. 

As used herein, "substantially free from other proteins" or "substantially 

10 purified" means at least 90%, preferably 95%, more preferably 99%, and even more 
preferably 99.9%, free of other proteins. Thus, a NHL protein preparation that is 
substantially free from other proteins will contain, as a percent of its total protein, no 
more than 10%, preferably no more than 5%, more preferably no more than 1%, and 
even more preferably no more than 0.1%, of non-NHL proteins. Whether a given 

15 NHL protein preparation is substantially free from other proteins can be determined 
by such conventional techniques of assessing protein purity as, e.g., sodium dodecyl 
sulfate polyacrylamide gel electrophoresis (SDS-PAGE) combined with appropriate 
detection methods, e.g., silver staining or immunoblotting. As used interchangeably 
with the terms "substantially free from other proteins" or "substantially purified", the 

20 terms "isolated NHL protein" or "purified NHL protein" also refer to NHL protein that 
has been isolated from a natural source. Use of the term "isolated" or "purified" 
indicates that NHL protein has been removed from its normal cellular environment. 
Thus, an isolated NHL protein may be in a cell-free solution or placed in a different 
cellular environment from that in which it occurs naturally. The term isolated does 

25 not imply that an isolated NHL protein is the only protein present, but instead means 
that an isolated NHL protein is substantially free of other proteins and non-amino acid 
material (e.g., nucleic acids, lipids, carbohydrates) naturally associated with the NHL 
protein in vivo. Thus, a NHL protein that is recombinantly expressed in a prokaryotic 
or eukaryotic cell and substantially purified from this host cell which does not 

30 naturally (i.e., without intervention) express this protein is of course "isolated NHL 
protein" under any circumstances referred to herein. As noted above, a NHL protein 
preparation that is an isolated or purified NHL protein will be substantially free from 
other proteins will contain, as a percent of its total protein, no more than 10%, 
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preferably no more than 5%, more preferably no more than 1%, and even more 
preferably no more than 0.1%, of non-NHL proteins. 

As used interchangeably herein, "functional equivalent" or "biologically active 
equivalent" means a protein which does not have exactly the same amino acid 

5 sequence as naturally occurring NHL, due to alternative splicing, deletions, mutations, 
substitutions, or additions, but retains substantially the same biological activity as 
NHL. Such functional equivalents will have significant amino acid sequence identity 
with naturally occurring NHL and genes and cDNA encoding such functional 
equivalents can be detected by reduced stringency hybridization with a DNA sequence 

10 encoding naturally occurring NHL. For example, a naturally occurring NHL disclosed 
herein comprises the amino acid sequence shown as SEQ ID NO:2 and is encoded by 
SEQ ID NO:l. A nucleic acid encoding a functional equivalent has at least about 
50% identity at the nucleotide level to SEQ ID NO:l. 

As used herein, "a conservative amino acid substitution" refers to the 

15 replacement of one amino acid residue by another, chemically similar, amino acid 
residue. Examples of such conservative substitutions are: substitution of one 
hydrophobic residue (isoleucine, leucine, valine, or methionine) for another; 
substitution of one polar residue for another polar residue of the same charge (e.g., 
arginine for lysine; glutamic acid for aspartic acid). 

20 As used herein, the term "mammalian" will refer to any mammal, including a 

human being. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 A-B shows the nucleotide sequence which comprises the open reading 
25 frame which encodes human NHL, the nucleotide sequence set forth as SEQ ID NO:l. 
The initiating Met residue (ATG) and the stop codon (TAG) are underlined. 

Figure 2 shows the amino acid sequence of human NHL as set forth in SEQ ID 

NO:2. 

Figure 3 shows the alignment of amino acid sequences of human NHL to 
30 ERCC2/RAD3 gene family members. Rep D (Dictyosteliem discoideum); RAD 3 
(5. cerevisiae); RAD 15 (5. pombe) and XP_GroupD (Homo sapiert). 

Figure 4 shows Northern analysis of NHL expression in multi-human tissues. 

Figure 5 A-B show the genomic structure of the NHL gene (Figure 5 A) and the 
entire 1 15 kb genomic region (Figure 5B) containing the NHL, M68/DcR3, SCLIP 
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and ARP genes. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to an isolated or purified nucleic acid molecule 
5 (polynucleotide) which encodes a novel mammalian DNA helicase. An especially 
preferred aspect of this invention relates to an isolated nucleic acid molecule 
(polynucleotide) which encodes mRNA which expresses a novel human DNA 
helicase, NHL. 

The gene M68/DcR3 is a secreted TNFR member that is overexpressed in a 

10 number of human tumors. M68/DcR3 is located at 20ql3.3, a known site that is 
associated with frequent gene amplification in cancer. M68/DcR3 protein binds to 
FASL and inhibit FAS mediated apoptosis. Thus, genes tightly linked to M68/DcR3 
may be coregulated (e.g. co overexpressed and/or amplified in tumors). During the 
course of cloning the genomic M68/DcR3 fragment and identifying genes that are 

15 linked to M68/DcR3 at 20ql3.3, three genes, including a novel gene that is similar to 
the Rad3/ERCC2 helicase family, were identified (termed NHL) in the immediately 
adjacent (overlapping) region. Given NHL's chromosomal location and the frequent 
association of DNA helicases with human genetic disorders (mutations in DNA 
helicases have been found associated with multiple diseases, including xeroderma 

20 pigmentosum, Cockayne's syndrome, Bloom's syndrome, and Werner's syndrome), 
NHL is a candidate for contribution to certain human neoplastic disorders. To this 
end, the genomic clone for this gene is disclosed and the complete sequence is 
determined. The transcript was identified through exon prediction using GRAIL2 and 
sequence alignment to a contiguous 4.5 kilobase region of chromosome 4 (88% 

25 sequence identity). The complete exon structure of NHL was subsequently confirmed 
by RT-PCR analysis. Multiple sequence alignment of NHL to known helicases 
showed that NHL contains all the seven critical helicase domains. BLAST analysis of 
the predicted 1,219 amino acid sequence revealed an approximately 26% sequence 
identity and 48% sequence similarity to the RAD3/ERCC2 gene family of DNA 

30 helicases (Naumovski et al., 1985 Mol Cell Biol 5:17-26; Reynolds et al., 1985 
Nucleic Acid Res 13:2357-72; Weber et al., 1990 EMBO J. 9:1437-1447). The 
mRNA expression pattern of NHL was also examined in multiple human tissues. 
Radiation hybrid chromosomal mapping reconfirms that it is linked to M68/DcR3 
locus. 
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A preferred aspect of the present invention relates to an isolated or purified 
DNA molecule which encodes human NHL, the nucleotide sequence as set forth in 
Figure 1A-B and SEQ ID NO:l, which is as follows: 





AGTCAGCCCT 


GCTGCCAGCC 


AGTGCCGGGT 


GCTGGGGACT 


CAGGGAGGCC 


CGCCGGGACC 


5 


ACTGCGGGAC 


AGTGAGCCGA 


GCAGAAGCTG 


GAACGCAGGA 


GAGGAAGGAG 


AGGGGGCGGT 




CAGGGCTCTC 


AGGAGCCGGG 


TCCTGGGCAA 


GGCGCAGCCG 


TTTTCAAATT 


TTCAGGAAAG 




CGGTCGGCTC 


ACACTCGAGC 


AGTAAAAAGA 


TGCCTCTGGG 


GAGGAGGCCC 


GTGCAGCTCT 




CCGGGCAATG 


GTGGTGGCTC 


GGCCTAGAGA 


GGCGGTAGTG 


GAACGCAGAC 


CCTGGTGGGG 




GAATGACATC 


AAGGGAGGAG 


ACGGGCGGGA 


CCCCAGATTT 


CTGCCTGTGG 


GCGATGGAAG 


10 


TGAGGTTCAC 


TGGCCAGCGG 


AGCCGGACAC 


AGAACGCGCA 


AAACGCCGTG 


TAGGCCTGGA 




GGAGCCGAAG 


AGCAGGCGGA 


CCCCCTCCGC 


GGGGGAACAG 


TTTCCGCCGG 


GAGCACAAAG 




CAACGGACCG 


GAAGTGGGGG 


GCGGAAGTGC 


AGTGGGCTCA 


GCGCCGACTG 


CGCGCCTCTG 




CCCGCGAAAA 


CTCTGAGCTG 


GCTGACAGCT 


GGGGACGGGT 


GGCGGCCCTC 


GACTGGAGTC 




GGTTGAGTTC 


CTGAGGGACC 


CCGGTTCTGG 


AAGGTTCGCC 


GCGGAGACAA 


GTGAGCAGTC 


15 


TGTGCCATAG 


GGATTCTCGA 


AGAGAACAGC 


GTTGTGTCCC 


AGTGCACATG 


CTCGCATCGC 




TTACCAGGAG 


TGCCCGAGAC 


CCTAAGATGT 


TCGGAGTGGT 


TTTTTCGCAC 


AGACCCGAAT 




AGCCTGCCCC 


TCAGCCACGC 


TCTGTGCCCT 


TCTGAGAACA 


GGCTGATATG 


CCCAAGATAG 




TCCTGAATGG 


TGTGACCGTA 


GACTTCCCTT 


TCCAGCCCTA 


CAAATGCCAA 


CAGGAGTACA 




TGACCAAGGT 


CCTGGAATGT 


CTGCAGCAGA 


AGGTGAATGG 


CATCCTGGAG 


AGCCCTACGG 


20 


GTACAGGGAA 


GACGCTGTGC 


CTGCTGTGCA 


CCACGCTGGC 


CTGGCGAGAA 


CACCTCCGAG 




ACGGCATCTC 


TGCCCGCAAG 


ATTGCCGAGA 


GGGCGCAAGG 


AGAGCTTTTC 


CCGGATCGGG 




CCTTGTCATC 


CTGGGGCAAC 


GCTGCTGCTG 


CTGCTGGAGA 


CCCCATAGCT 


TGCTACACGG 




ACATCCCAAA 


GATTATTTAC 


GCCTCCAGGA 


CCCACTCGCA 


ACTCACACAG 


GTCATCAACG 




AGCTTCGGAA 


CACCTCCTAC 


CGGCCTAAGG 


TGTGTGTGCT 


GGGCTCCCGG 


GAGCAGCTGT 


25 


GCATCCATCC 


TGAGGTGAAG 


AAACAAGAGA 


GTAACCATCT 


ACAGATCCAC 


TTGTGCCGTA 




AGAAGGTGGC 


AAGTCGCTCC 


TGTCATTTCT 


ACAACAACGT 


AGAAGAAAAA 


AGCCTGGAGC 




AGGAGCTGGC 


CAGCCCCATC 


CTGGACATTG 


AGGACTTGGT 


CAAGAGCGGA 


AGCAAGCACA 




GGGTGTGCCC 


TTACTACCTG 


TCCCGGAACC 


TGAAGCAGCA 


AGCCGACATC 


ATATTCATGC 




CGTACAATTA 


CTTGTTGGAT 


GCCAAGAGCC 


GCAGAGCACA 


CAACATTGAC 


CTGAAGGGGA 


30 


CAGTCGTGAT 


CTTTGACGAA 


GCTCACAACG 


TGGAGAAGAT 


GTGTGAAGAA 


TCGGCATCCT 




TTGACCTGAC 


TCCCCATGAC 


CTGGCTTCAG 


GACTGGACGT 


CATAGACCAG 


GTGCTGGAGG 




AGCAGACCAA 


GGCAGCGCAG 


CAGGGTGAGC 


CCCACCCGGA 


GTTCAGCGCG 


GACTCCCCCA 




GCCCAGGGCT 


GAACATGGAG 


CTGGAAGACA 


TTGCAAAGCT 


GAAGATGATC 


CTGCTGCGCC 




TGGAGGGGGC 


CATCGATGCT 


GTTGAGCTGC 


CTGGAGACGA 


CAGCGGTGTC 


ACCAAGCCAG 
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GGAGCTACAT 


CTTTGAGCTG 


TTTGCTGAAG 


CCCAGATCAC 


GTTTCAGACC 


AAGGGCTGCA 




TCCTGGACTC 


GCTGGACCAG 


ATCATCCAGC 


ACCTGGCAGG 


ACGTGCTGGA 


GTGTTCACCA 




ACACGGCCGG 


ACTGCAGAAG 


CTGGCGGACA 


TT ATC C AG AT 


TGTGTTCAGT 


GTGGACCCCT 




CCGAGGGCAG 


CCCTGGTTCC 


CCAGCAGGGC 


TGGGGGCCTT 


ACAGTCCTAT 


AAGGTGCACA 


5 


TCCATCCTGA 


TGCTGGTCAC 


CGGAGGACGG 


CTCAGCGGTC 


TGATGCCTGG 


AGCACCACTG 




CAGCCAGAAA 


GCGAGGGAAG 


GTGCTGAGCT 


ACTGGTGCTT 


CAGTCCCGGC 


CACAGCATGC 




ACGAGCTGGT 


CCGCCAGGGC 


GTCCGCTCCC 


TCATCCTTAC 


CAGCGGCACG 


CTGGCCCCGG 




TGTCCTCCTT 


TGCTCTGGAG 


ATGCAGATCC 


CTTTCCCAGT 


CTGCCTGGAG 


AACCCACACA 




TCATCGACAA 


GCACCAGATC 


TGGGTGGGGG 


TCGTCCCCAG 


AGGCCCCGAT 


GGAGCCCAGT 


10 


TGAGCTCCGC 


GTTTGACAGA 


CGGTTTTCCG 


AGGAGTGCTT 


ATCCTCCCTG 


GGGAAGGCTC 




TGGGCAACAT 


CGCCCGCGTG 


GTGCCCTATG 


GGCTCCTGAT 


CTTCTTCCCT 


TCCTATCCTG 




TCATGGAGAA 


GAGCCTGGAG 


TTCTGGCGGG 


CCCGCGACTT 


GGCCAGGAAG 


ATGGAGGCGC 




TGAAGCCGCT 


GTTTGTGGAG 


CCCAGGAGCA 


AAGGCAGCTT 


CTCCGAGACC 


ATCAGTGCTT 




ACTATGCAAG 


GGTTGCCGCC 


CCTGGGTCCA 


CCGGCGCCAC 


CTTCCTGGCG 


GTCTGCCGGG 


15 


GCAAGGCCAG 


CGAGGGGCTG 


GACTTCTCAG 


ACACGAATGG 


CCGTGGTGTG 


ATTGTCACGG 




GCCTCCCGTA 


CCCCCCACGC 


ATGGACCCCC 


GGGTTGTCCT 


CAAGATGCAG 


TTCCTGGATG 




AGATGAAGGG 


CCAGGGTGGG 


GCTGGGGGCC 


AGTTCCTCTC 


TGGGCAGGAG 


TGGTACCGGC 




AGCAGGCGTC 


CAGGGCTGTG 


AACCAGGCCA 


TCGGGCGAGT 


GATCCGGCAC 


CGCCAGGACT 




ACGGAGCTGT 


CTTCCTCTGT 


GACCACAGGT 


TCGCCTTTGC 


CGACGCAAGA 


GCCCAACTGC 


20 


CCTCCTGGGT 


GCGTCCCCAC 


GTCAGGGTGT 


ATGACAACTT 


TGGCCATGTC 


ATCCGAGACG 




TGGCCCAGTT 


CTTCCGTGTT 


GCCGAGCGAA 


CTATGCCAGC 


GCCGGCCCCC 


CGGGCTACAG 




CACCCAGTGT 


GCGTGGAGAA 


GATGCTGTCA 


GCGAGGCCAA 


GTCGCCTGGC 


CCCTTCTTCT 




CCACCAGGAA 


AGCTAAGAGT 


CTGGACCTGC 


ATGTCCCCAG 


CCTGAAGCAG 


AGGTCCTCAG 




GGTCACCAGC 


TGCCGGGGAC 


CCCGAGAGTA 


GCCTGTGTGT 


GGAGTATGAG 


CAGGAGCCAG 


25 


TTCCTGCCCG 


GCAGAGGCCC 


AGGGGGCTGC 


TGGCCGCCCT 


GG AGC AC AGC 


GAAC AGC GGG 




CGGGGAGCCC 


TGGCGAGGAG 


CAGGCCCACA 


GCTGCTCCAC 


CCTGTCCCTC 


CTGTCTGAGA 




AGAGGCCGGC 


AGAAGAACCG 


CGAGGAGGGA 


GGAAGAAGAT 


CCGGCTGGTC 


AGCCACCCGG 




AGGAGCCCGT 


GGCTGGTGCA 


CAGACGGACA 


GGGCCAAGCT 


CTTCATGGTG 


GCCGTGAAGC 




AGGAGTTGAG 


CCAAGCCAAC 


TTTGCCACCT 


TCACCCAGGC 


CCTGCAGGAC 


TACAAGGGTT 


30 


CCGATGACTT 


CGCCGCCCTG 


GCCGCCTGTC 


TCGGCCCCCT 


CTTTGCTGAG 


GACCCCAAGA 




AGCACAACCT 


GCTCCAAGGC 


TTCTACCAGT 


TTGTGCGGCC 


CCACCATAAG 


CAGCAGTTTG 




AGGAGGTCTG 


TATCCAGCTG 


ACAGGACGAG 


GCTGTGGCTA 


TCGGCCTGAG 


CACAGCATTC 




CCCGAAGGCA 


GCGGGCACAG 


CCGGTCCTGG 


ACCCCACTGG 


AAGAACGGCG 


CCGGATCCCA 




AGCTGACCGT 


GTCCACGGCT 


GCAGCCCAGC 


AGCTGGACCC 


CCAAGAGCAC 


CTGAACCAGG 
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GCAGGCCCCA 


CCTGTCGCCC 


AGGCCACCCC 


CAACAGGAGA 


CCCTGGCAGC 


CAACCACAGT 


GGGGGTCTGG 


AGTGCCCAGA 


GCAGGGAAGC 


AGGGCCAGCA 


CGCCGTGAGC 


GCCTACCTGG 


CTGATGCCCG 


CAGGGCCCTG 


GGGTCCGCGG 


GCTGTAGCCA 


ACTCTTGGCA 


GCGCTGACAG 


CCTATAAGCA 


AGACGACGAC 


CTCGACAAGG 


TGCTGGCTGT 


GTTGGCCGCC 


CTGACCACTG 


CAAAGCCAGA 


GGACTTCCCC 


CTGCTGCACA 


GGTTCAGCAT 


GTTTGTGCGT 


CCACACCACA 


AGCAGCGCTT 


CTCACAGACG 


TGCACAGACC 


TGACCGGCCG 


GCCCTACCCG 


GGCATGGAGC 


CACCGGGACC 


CCAGGAGGAG 


AGGCTTGCCG 


TGCCTCCTGT 


GCTTACCCAC 


AGGGCTCCCC 


AACCAGGCCC 


CTCACGGTCC 


GAGAAGACCG 


GGAAGACCCA 


GAGCAAGATC 


TCGTCCTTCC 


TTAGACAGAG 


GCCAGCAGGG 


ACTGTGGGGG 


CGGGCGGTGA 


GGATGCAGGT 


CCCAGCCAGT 


CCTCAGGACC 


TCCCCACGGG 


CCTGCAGCAT 


CTGAGTGGGG 


CCTCTAGGAT 


GTGCCCAGCC 


TGCCACACCG 


CCTCCAGGAA 


GCAGAGCGTC 


ATGCAGGTCT 


TCTGGCCAGA 


GCCCCAGTGA 


GTGCCCACGG 


AGGCCCCCAG 


CACACCCAAC 


GTGGCTTGAT 


CACCTGCCTG 


TCCAGCTCTG 


GTGGGCCAAG 


AACCCACCCA 


ACAGAATAGG 


CCAGCCCATG 


CCAGCCGGCT 


TGGCCCGCTG 


CAGGCCTCAG 


GCAGGCGGGG 


CCCATGGTTG 


GTCCCTGCGG 


TGGGACCGGA 


TCTGGGCCTG 


CCTCTGAGAA 


GCCCTGAGCT 


ACCTTGGGGT 


CTGGGGTGGG 


TTTCTGGGAA 


AGTGCTTCCC 


CAGAACTTCC 


CTGGCTCCTG 


GCCTGTGAGT 


GGTGCCACAG 


GGGCACCCCA 


GCTGAGCCCC 


TCACCGGGAA 


GGAGGAGACC 


CCCGTGGGCA 


CGTGTCCACT 


TTTAATCAGG 


GGACAGGGCT 


CTCTAATAAA 


GCTGCTGGCA 


GTGCCC (SEQ ID NO:l) . 







The above-exemplified isolated DNA molecule shown in Figure 1 A-B and 
SEQ ID NO:l comprise 4946 nucleotides, with an initiating Met at nucleotides 828- 
830 and a "TAG" termination codon at nucleotides 4585-4587. The initiating Met and 
TAG termination codon are underlined. 

The present invention also relates to biologically active fragments or mutants 
of SEQ ID NO: 1 which encode a mRNA molecule expressing a novel DNA helicase, 
NHL. Any such biologically active fragment and/or mutant will encode either a 
protein or protein fragment which at least substantially mimics the biological 
properties of the human NHL protein disclosed herein in Figure 2 and as set forth as 
SEQ ID NO:2. Any such polynucleotide includes but is not necessarily limited to 
nucleotide substitutions, deletions, additions, amino-terminal truncations and carboxy- 
terminal truncations such that these mutations encode mRNA which express a 
functional NHL protein in a host cell, so as to be useful for screening for agonists 
and/or antagonists of NHL activity. 

The isolated nucleic acid molecules of the present invention may include a 
deoxyribonucleic acid molecule (DNA), such as genomic DNA and complementary 
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DNA (cDNA), which may be single (coding or noncoding strand) or double stranded, 

as well as synthetic DNA, such as a synthesized, single stranded polynucleotide. The 

isolated nucleic acid molecule of the present invention may also include a ribonucleic 

acid molecule (RNA). 
5 The present invention also relates to recombinant vectors and recombinant 

hosts, both prokaryotic and eukaryotic, which contain the substantially purified 

nucleic acid molecules disclosed throughout this specification. 

The degeneracy of the genetic code is such that, for all but two amino 

acids, more than a single codon encodes a particular amino acid. This allows for 
10 the construction of synthetic DNA that encodes the NHL protein where the 

nucleotide sequence of the synthetic DNA differs significantly from the nucleotide 

sequence of SEQ ID NO:l but still encodes the same NHL protein as SEQ ID 

NO:2. Such synthetic DNAs are intended to be within the scope of the present 

invention. If it is desired to express such synthetic DNAs in a particular host cell 
15 or organism, the codon usage of such synthetic DNAs can be adjusted to reflect 

the codon usage of that particular host, thus leading to higher levels of expression 

of the NHL protein in the host. In other words, this redundancy in the various 

codons which code for specific amino acids is within the scope of the present 

invention. Therefore, this invention is also directed to those DNA sequences 
20 which encode RNA comprising alternative codons which code for the eventual 

translation of the identical amino acid, as shown below: 

A=Ala=AIanine: codons GCA, GCC, GCG, GCU 

C=Cys=Cysteine: codons UGC, UGU 

D=Asp=Aspartic acid: codons GAC, GAU 
25 E=Glu=Glutamic acid: codons GAA, GAG 

F=Phe=Phenylalanine: codons UUC, UUU 

G=Gly=Glycine: codons GGA, GGC, GGG, GGU 

H=His =Histidine: codons CAC, CAU 

I=Ile =JsoIeucine: codons AUA, AUC, AUU 
30 K=Lys=Lysine: codons AAA, AAG 

L=Leu=Leucine: codons UUA, UUG, CUA, CUC, CUG, CUU 

M=Met=Methionine: codon AUG 

N=Asp=Asparagine: codons AAC, AAU 

P=Pro=Proline: codons CCA, CCC, CCG, CCU 
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Q=Gln=Glutamine: codons CAA, CAG 
R-Arg=Arginine: codons AGA, AGG, CGA, CGC, CGG, CGU 
S=Ser=Serine: codons AGC, AGU, UCA, UCC, UCG, UCU 
T=Thr=Threonine: codons ACA, ACC, ACG, ACU 
5 V=Val=Valine: codons GUA, GUC, GUG, GUU 
W=Trp=Tryptophan: codon UGG 
Y=Tyn=Tyrosine: codons UAC, UAU 

Therefore, the present invention discloses codon redundancy which may result in 
differing DNA molecules expressing an identical protein. For purposes of this 

10 specification, a sequence bearing one or more replaced codons will be defined as a 
degenerate variation. Also included within the scope of this invention are 
mutations either in the DNA sequence or the translated protein which do not 
substantially alter the ultimate physical properties of the expressed protein. For 
example, substitution of valine for leucine, arginine for lysine, or asparagine for 

15 glutamine may not cause a change in functionality of the polypeptide. 

It is known that DNA sequences coding for a peptide may be altered so as 
to code for a peptide having properties that are different than those of the naturally 
occurring peptide. Methods of altering the DNA sequences include but are not 
limited to site directed mutagenesis. Examples of altered properties include but 

20 are not limited to changes in the affinity of an enzyme for a substrate or a receptor 
for a ligand. 

The present invention also relates to recombinant vectors and recombinant 
hosts, both prokaryotic and eukaryotic, which contain the substantially purified 
nucleic acid molecules disclosed throughout this specification. The nucleic acid 

25 molecules of the present invention encoding a NHL protein, in whole or in part, can 
be linked with other DNA molecules, i.e, DNA molecules to which the NHL coding 
sequence are not naturally linked, to form "recombinant DNA molecules" which 
encode a respective NHL protein. The novel DNA sequences of the present invention 
can be inserted into vectors which comprise nucleic acids encoding NHL or a 

30 functional equivalent. These vectors may be comprised of DNA or RNA; for most 
cloning purposes DNA vectors are preferred. Typical vectors include plasmids, 
modified viruses, bacteriophage, cosmids, yeast artificial chromosomes, and other 
forms of episomal or integrated DNA that can encode a NHL protein. It is well within 
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the purview of the skilled artisan to determine an appropriate vector for a particular 
gene transfer or other use. 

Included in the present invention are DNA sequences that hybridize to SEQ ID 
NO:l under stringent conditions. By way of example, and not limitation, a procedure 

5 using conditions of high stringency is as follows: Prehybridization of filters containing 
DNA is carried out for 2 hours to overnight at 65°C in buffer composed of 6X SSC, 
5X Denhardt's solution, and 100 /xg/ml denatured salmon sperm DNA. Filters are 
hybridized for 12 to 48 hrs at 65°C in prehybridization mixture containing 100 jxg/ml 
denatured salmon sperm DNA and 5-20 X 10 6 cpm of 32 P-Iabeled probe. Washing of 

10 filters is done at 37°C for 1 hr in a solution containing 2X SSC, 0.1% SDS. This is 
followed by a wash in 0.1X SSC, 0.1% SDS at 50°C for 45 min. before 
autoradiography. Other procedures using conditions of high stringency would include 
either a hybridization step carried out in 5XSSC, 5X Denhardt's solution, 50% 
formamide at 42°C for 12 to 48 hours or a washing step carried out in 0.2X SSPE, 

15 0.2% SDS at 65°C for 30 to 60 minutes. 

Reagents mentioned in the foregoing procedures for carrying out high 
stringency hybridization are well known in the art. Details of the composition of these 
reagents can be found in, e.g., Sambrook et al., 1989, Molecular Cloning: A 
Laboratory Manual; Cold Spring Harbor Laboratory, Cold Spring Harbor, New York. 

20 In addition to the foregoing, other conditions of high stringency which may be used 
are well known in the art. 

The present invention also relates to a substantially purified form of a human 
NHL protein which comprises the amino acid sequence (1219 amino acid residues) 
disclosed in Figure 2 and set forth as SEQ ID NO:2. A preferred aspect of this portion 

25 of the present invention is a NHL protein which consists of the amino acid sequence 
disclosed in Figure 2 and set forth as SEQ ID NO:2, as follows: 



MPKIVLNGVT 


VDFPFQPYKC 


QQEYMTKVLE 


CLQQKVNGIL 


ESPTGTGKTL 


CLLCTTLAWR 


EHLRDGISAR 


KIAERAQGEL 


FPDRALSSWG 


NAAAAAGDPI 


ACYTDIPKII 


YASRTHSQLT 


QVINELRNTS 


YRPKVCVLGS 


REQLCIHPEV 


KKQESNHLQI 


HLCRKKVASR 


SCHFYNNVEE 


KSLEQELASP 


ILDIEDLVKS 


GSKHRVCPYY 


LSRNLKQQAD 


IIFMPYNYLL 


DAKSRRAHNI 


DLKGTWIFD 


EAHNVEKMCE 


ESASFDLTPH 


DLASGLDVID 


QVLEEQTKAA 


QQGEPHPEFS 


ADSPSPGLNM 


ELEDIAKLKM 


ILLRLEGAID 


AVELPGDDSG 


VTKPGSYIFE 


LFAEAQITFQ 


TKGCILDSLD 


QIIQHLAGRA 


GVFTNTAGLQ 


KLADIIQIVF 


SVDPSEGSPG 


SPAGLGALQS 


YKVHIHPDAG 


HRRTAQRSDA 


WSTTAARKRG 


KVLSYWCFSP 


GHSMHELVRQ 


GVRSLILTSG 
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TLAPVSSFAL 


EMQIPFPVCL 


ENPHIIDKHQ 


IWVGWPRGP 


DGAQLSSAFD 


RRFSEECLSS 


LGKALGNIAR 


WPYGLLIFF 


PSYPVMEKSL 


EFWRARDLAR 


KMEALKPLFV 


EPRSKGSFSE 


TISAYYARVA 


APGSTGATFL 


AVCRGKASEG 


LDFSDTNGRG 


VIVTGLPYPP 


RMDPRWLKM 


QFLDEMKGQG 


GAGGQFLSGQ 


EWYRQQASRA 


VNQAIGRVIR 


HRQDYGAVFL 


CDHRFAFADA 


RAQLPSWVRP 


HVRVYDNFGH 


VIRDVAQFFR 


VAERTMPAPA 


PRATAPSVRG 


EDAVSEAKSP 


GPFFSTRKAK 


SLDLHVPSLK 


QRSSGSPAAG 


DPESSLCVEY 


EQEPVPARQR 


PRGLLAALEH 


SEQRAGSPGE 


EQAHSCSTLS 


LLSEKRPAEE 


PRGGRKKIRL 


VSHPEEPVAG 


AQTDRAKLFM 


VAVKQELSQA 


NFATFTQALQ 


DYKGSDDFAA 


LAACLGPLFA 


EDPKKHNLLQ 


GFYQFVRPHH 


KQQFEEVCIQ 


LTGRGCGYRP 


EHSIPRRQRA 


QPVLDPTGRT 


APDPKLTVST 


AAAQQLDPQE 


HLNQGRPHLS 


PRPPPTGDPG 


SQPQWGSGVP 


RAGKQGQHAV 


SAYLADARRA 


LGSAGCSQLL 


AALTAYKQDD 


DLDKVLAVLA 


ALTTAKPEDF 


PLLHRFSMFV 


RPHHKQRFSQ 


TCTDLTGRPY 


PGMEPPGPQE 


ERLAVPPVLT 


HRAPQPGPSR 


SEKTGKTQSK 


ISSFLRQRPA 


GTVGAGGEDA 


GPSQSSGPPH 


GPAASEWGL* 


(SEQ ID NO; 


:2> . 







The present invention also relates to biologically active fragments and/or 

15 mutants of the human NHL protein comprising the amino acid sequence as set forth in 
SEQ ID NO:2, including but not necessarily limited to amino acid substitutions, 
deletions, additions, amino terminal truncations and carboxy-terminal truncations such 
that these mutations provide for proteins or protein fragments of diagnostic, 
therapeutic or prophylactic use and would be useful for screening for agonists and/or 

20 antagonists of NHL function. 

Another preferred aspect of the present invention relates to a substantially 
purified, fully processed NHL protein obtained from a recombinant host cell 
containing a DNA expression vector which comprises a nucleotide sequence as set 
forth in SEQ ID NO:l and expresses the human NHL protein. It is especially 

25 preferred is that the recombinant host cell be a eukaryotic host cell, such as a 
mammalian cell line. 

As with many proteins, it is possible to modify many of the amino acids of 
NHL protein and still retain substantially the same biological activity as the wild type 
protein. Thus this invention includes modified NHL polypeptides which have amino 

30 acid deletions, additions, or substitutions but that still retain substantially the same 
biological activity as a respective, corresponding NHL. It is generally accepted that 
single amino acid substitutions do not usually alter the biological activity of a protein 
(see, e.g., Molecular Biology of the Gene, Watson et al, 1987, Fourth Ed., The 
Benjamin/Cummings Publishing Co., Inc., page 226; and Cunningham & Wells, 1989, 
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Science 244:1081-1085). Accordingly, the present invention includes a polypeptide 
where one amino acid substitution has been made in SEQ ID NO:2 wherein the 
polypeptide still retains substantially the same biological activity as a corresponding 
NHL protein. The present invention also includes polypeptides where two or more 
5 amino acid substitutions have been made in SEQ ID NO:2 wherein the polypeptide 
still retains substantially the same biological activity as a corresponding NHL protein. 
In particular, the present invention includes embodiments where the above-described 
substitutions are conservative substitutions. 

One skilled in the art would also recognize that polypeptides that are 

10 functional equivalents of NHL and have changes from the NHL amino acid sequence 
that are small deletions or insertions of amino acids could also be produced by 
following the same guidelines, (i.e, minimizing the differences in amino acid 
sequence between NHL and related proteins. Small deletions or insertions are 
generally in the range of about 1 to 5 amino acids). The effect of such small deletions 

15 or insertions on the biological activity of the modified NHL polypeptide can easily be 
assayed by producing the polypeptide synthetically or by making the required changes 
in DNA encoding NHL and then expressing the DNA recombinantly and assaying the 
protein produced by such recombinant expression. 

The present invention also includes truncated forms of NHL which contain the 

20 region comprising the active site of the enzyme. Such truncated proteins are useful in 
various assays described herein, for crystallization studies, and for structure-activity- 
relationship studies. 

The present invention also relates to isolated nucleic acid molecules which are 
fusion constructions expressing fusion proteins useful in assays to identify compounds 

25 which modulate wild-type NHL activity, as well as generating antibodies against 
NHL. One aspect of this portion of the invention includes, but is not limited to, 
glutathione S-transferase (GST)-NHL fusion constructs. Recombinant GST-NHL 
fusion proteins may be expressed in various expression systems, including Spodoptera 
frugiperda (Sf21) insect cells (Invitrogen) using a baculovirus expression vector 

30 (pAcG2T, Pharmingen). Another aspect involves NHL fusion constructs linked to 
various markers, including but not limited to GFP (Green fluorescent protein), the 
MYC epitope, and GST. Again, any such fusion constructs may be expressed in the 
cell line of interest and used to screen for modulators of one or more of the NHL 
proteins disclosed herein. 
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Any of a variety of procedures may be used to clone NHL. These methods 
include, but are not limited to, (1) a RACE PCR cloning technique (Frohman, et al., 
1988, Proc. Natl. Acad. ScL USA 85: 8998-9002). 5' and/or 3' RACE may be 
performed to generate a full-length cDNA sequence. This strategy involves using 

5 gene-specific oligonucleotide primers for PCR amplification of NHL cDNA. These 
gene-specific primers are designed through identification of an expressed sequence 
tag (EST) nucleotide sequence which has been identified by searching any number of 
publicly available nucleic acid and protein databases; (2) direct functional expression 
of the NHL cDNA following the construction of a NHL-containing cDNA library in 

10 an appropriate expression vector system; (3) screening a NHL-containing cDNA 
library constructed in a bacteriophage or plasmid shuttle vector with a labeled 
degenerate oligonucleotide probe designed from the amino acid sequence of the NHL 
protein; (4) screening a NHL-containing cDNA library constructed in a bacteriophage 
or plasmid shuttle vector with a partial cDNA encoding the NHL protein. This partial 

15 cDNA is obtained by the specific PCR amplification of NHL DNA fragments through 
the design of degenerate oligonucleotide primers from the amino acid sequence 
known for other kinases which are related to the NHL protein; (5) screening a NHL- 
containing cDN A library constructed in a bacteriophage or plasmid shuttle vector with 
a partial cDNA or oligonucleotide with homology to a mammalian NHL protein. This 

20 strategy may also involve using gene-specific oligonucleotide primers for PCR 
amplification of NHL cDNA identified as an EST as described above; or 
(6) designing 5' and 3' gene specific oligonucleotides using SEQ ID NO: 1 as a 
template so that either the full-length cDNA may be generated by known RACE 
techniques, or a portion of the coding region may be generated by these same known 

25 RACE techniques to generate and isolate a portion of the coding region to use as a 
probe to screen one of numerous types of cDNA and/or genomic libraries in order to 
isolate a full-length version of the nucleotide sequence encoding NHL. 

It is readily apparent to those skilled in the art that other types of libraries, as 
well as libraries constructed from other cell types-or species types, may be useful for 

30 isolating a NHL-encoding DNA or a NHL homologue. Other types of libraries 
include, but are not limited to, cDNA libraries derived from other cells. 

It is readily apparent to those skilled in the art that suitable cDNA libraries 
may be prepared from cells or cell lines which have NHL activity. The selection of 
cells or cell lines for use in preparing a cDNA library to isolate a cDNA encoding 
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NHL may be done by first measuring cell-associated NHL activity using any known 
assay available for such a purpose. 

Preparation of cDNA libraries can be performed by standard techniques well 
known in the art. Well known cDNA library construction techniques can be found for 

5 example, in Sambrook et a!., 1989, Molecular Cloning: A Laboratory Manual; Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New York. Complementary DNA 
libraries may also be obtained from numerous commercial sources, including but not 
limited to Clontech Laboratories, Inc. and Stratagene. 

It is also readily apparent to those skilled in the art that DNA encoding NHL 

10 may also be isolated from a suitable genomic DNA library. Construction of genomic 
DNA libraries can be performed by standard techniques well known in the art. Well 
known genomic DNA library construction techniques can be found in Sambrook, et 
al., supra. One may prepare genomic libraries, especially in PI artificial chromosome 
vectors, from which genomic clones containing the NHL gene can be isolated, using 

15 probes based upon the NHL nucleotide sequences disclosed herein. Methods of 
preparing such libraries are known in the art (Ioannou et al., 1994, Nature Genet. 
6:84-89). 

In order to clone a NHL gene by one of the preferred methods, the amino acid 
sequence or DNA sequence of a NHL or a homologous protein may be necessary. To 

20 accomplish this, a respective NHL protein may be purified and the partial amino acid 
sequence determined by automated sequenators. It is not necessary to determine the 
entire amino acid sequence, but the linear sequence of two regions of 6 to 8 amino 
acids can be determined for the PCR amplification of a partial NHL DNA fragment. 
Once suitable amino acid sequences have been identified, the DNA sequences capable 

25 of encoding them are synthesized. Because the genetic code is degenerate, more than 
one codon may be used to encode a particular amino acid, and therefore, the amino 
acid sequence can be encoded by any of a set of similar DNA oligonucleotides. Only 
one member of the set will be identical to the NHL sequence but others in the set will 
be capable of hybridizing to NHL DNA even in the presence of DNA 

30 oligonucleotides with mismatches. The mismatched DNA oligonucleotides may still 
sufficiently hybridize to the NHL DNA to permit identification and isolation of NHL 
encoding DNA. Alternatively, the nucleotide sequence of a region of an expressed 
sequence may be identified by searching one or more available genomic databases. 
Gene-specific primers may be used to perform PCR amplification of a cDNA of 
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interest from either a cDNA library or a population of cDNAs. As noted above, the 
appropriate nucleotide sequence for use in a PCR-based method may be obtained 
from SEQ ID NO: 1 either for the purpose of isolating overlapping 5' and 3' RACE 
products for generation of a full-length sequence coding for NHL, or to isolate a 

5 portion of the nucleotide sequence coding for NHL for use as a probe to screen one or 
more cDNA- or genomic-based libraries to isolate a full-length sequence encoding 
NHL or NHL-like proteins. 

This invention also includes vectors containing a NHL gene, host cells 
containing the vectors, and methods of making substantially pure NHL protein 

10 comprising the steps of introducing the NHL gene into a host cell, and cultivating the 
host cell under appropriate conditions such that NHL is produced. The NHL so 
produced may be harvested from the host cells in conventional ways. Therefore, the 
present invention also relates to methods of expressing the NHL protein and 
biological equivalents disclosed herein, assays employing these gene products, 

15 recombinant host cells which comprise DNA constructs which express these proteins, 
and compounds identified through these assays which act as agonists or antagonists of 
NHL activity. 

The cloned NHL cDNA obtained through the methods described above may 
be recombinantly expressed by molecular cloning into an expression vector (such as 

20 pcDNA3.neo, pcDNA3.1, pCR2.1, pBIueBacHis2 or pLITMUS28) containing a 
suitable promoter and other appropriate transcription regulatory elements, and 
transferred into prokaryotic or eukaryotic host cells to produce recombinant NHL. 
Expression vectors are defined herein as DNA sequences that are required for the 
transcription of cloned DNA and the translation of their mRNAs in an appropriate 

25 host. Such vectors can be used to express eukaryotic DNA in a variety of hosts such 
as bacteria, blue green algae, plant cells, insect cells and animal cells. Specifically 
designed vectors allow the shuttling of DNA between hosts such as bacteria-yeast or 
bacteria-animal cells. An appropriately constructed expression vector should contain: 
an origin of replication for autonomous replication in host cells, selectable markers, a 

30 limited number of useful restriction enzyme sites, a potential for high copy number, 
and active promoters. A promoter is defined as a DNA sequence that directs RNA 
polymerase to bind to DNA and initiate RNA synthesis. A strong promoter is one 
which causes mRNAs to be initiated at high frequency. To determine the NHL cDNA 
sequence(s) that yields optimal levels of NHL, cDNA molecules including but not 
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limited to the following can be constructed: a cDNA fragment containing the full- 
length open reading frame for NHL as well as various constructs containing portions 
of the cDNA encoding only specific domains of the protein or rearranged domains of 
the protein. All constructs can be designed to contain none, all or portions of the 5' 

5 and/or 3' untranslated region of a NHL cDNA. The expression levels and activity of 
NHL can be determined following the introduction, both singly and in combination, 
of these constructs into appropriate host cells. Following determination of the NHL 
cDNA cassette yielding optimal expression in transient assays, this NHL cDNA 
construct is transferred to a variety of expression vectors (including recombinant 

10 viruses), including but not limited to those for mammalian cells, plant cells, insect 
cells, oocytes, bacteria, and yeast cells. Techniques for such manipulations can be 
found described in Sambrook, et al., supra, are well known and available to the 
artisan of ordinary skill in the art. Therefore, another aspect of the present invention 
includes host cells that have been engineered to contain and/or express DNA 

15 sequences encoding the NHL protein. An expression vector containing DNA 

encoding a NHL-like protein may be used for expression of NHL in a recombinant 
host cell. Such recombinant host cells can be cultured under suitable conditions to 
produce NHL or a biologically equivalent form. Expression vectors may include, but 
are not limited to, cloning vectors, modified cloning vectors, specifically designed 

20 plasmids or viruses. Commercially available mammalian expression vectors which 
may be suitable for recombinant NHL expression, include but are not limited to, 
pcDNA3.neo (Invitrogen), pcDNA3.1 (Invitrogen), pCI-neo (Promega), pLITMUS28, 
pUTMUS29, pLTTMUS38 and pLITMUS39 (New England Bioloabs), pcDNAI, 
pcDNAIamp (Invitrogen), pcDNA3 (Invitrogen), pMClneo (Stratagene), pXTl 

25 (Stratagene), pSG5 (Stratagene), EBO-pS V2-neo (ATCC 37593) pBP V- 1 (8-2) 

(ATCC 37110), pdBPV-MMTneo(342-12) (ATCC 37224), pRSVgpt (ATCC 37199), 
pRSVneo (ATCC 37198), pSV2-dhfr (ATCC 37146), pUCTag (ATCC 37460), and 
1ZD35 (ATCC 37565). Also, a variety of bacterial expression vectors may be used to 
express recombinant NHL in bacterial cells. Commercially available bacterial 

30 expression vectors which may be suitable for recombinant NHL expression include, 
but are not limited to pCR2.1 (Invitrogen), pETl la (Novagen), lambda gtl 1 
(Invitrogen), and pKK223-3 (Pharmacia). In addition, a variety of fungal cell 
expression vectors may be used to express recombinant NHL in fungal cells. 
Commercially available fungal cell expression vectors which may be suitable for 
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recombinant NHL expression include but are not limited to pYES2 (Invitrogen) and 
Pichia expression vector (Invitrogen). Also, a variety of insect cell expression vectors 
may be used to express recombinant protein in insect cells. Commercially available 
insect cell expression vectors which may be suitable for recombinant expression of 

5 NHL include but are not limited to pBlueBacHI and pBlueBacHis2 (Invitrogen), and 
pAcG2T (Pharmingen). 

Recombinant host cells may be prokaryotic or eukaryotic, including but not 
limited to, bacteria such as £. coli, fungal cells such as yeast, mammalian cells 
including, but not limited to, cell lines of bovine, porcine, monkey and rodent origin; 

10 and insect cells including but not limited to Drosophila and silkworm derived cell 
lines. For instance, one insect expression system utilizes Spodoptera frugiperda 
(Sf21) insect cells (Invitrogen) in tandem with a baculovirus expression vector 
(pAcG2T, Pharmingen). Also, mammalian species which may be suitable and which 
are commercially available, include but are not limited to, L cells L-M(TK~) (ATCC 

15 CCL 1.3), L cells L-M (ATCC CCL 1.2), Saos-2 (ATCC HTB-85), 293 (ATCC CRL 
1573), Raji (ATCC CCL 86), CV-1 (ATCC CCL 70), COS-1 (ATCC CRL 1650), 
COS-7 (ATCC CRL 1651), CHO-K1 (ATCC CCL 61), 3T3 (ATCC CCL 92), 
NIH/3T3 (ATCC CRL 1658), HeLa (ATCC CCL 2), C127I (ATCC CRL 1616), BS- 
C-l (ATCC CCL 26), MRC-5 (ATCC CCL 171) and CPAE (ATCC CCL 209). 

20 As disclosed in Example section 1, a 1 15 kb BAC clone (from Genome 

Systems) was subcloned and subjected to restriction and sequence analysis. Four 
genes at chromosome location 20ql3.3 were identified, including M68/DcR3, NHL, 
SCLIP and ARP (Figure 5 A). The nucleotide sequence of this BAC clone, hbml68, 
is presented as follows: 



TGAAGAGCTT 


TGACCAAGAG 


GCTGTGACGA 


GGCCCTACGA 


GGACTCTGGC 


TCTCCTCCTG 


60 


CTAAGCACAC 


CCAGGCAGGT 


GTCCTGGCAG 


ATGAGGACCA 


CATGCAGAGC 


CTCGGCCAGC 


120 


CCACCAATGC 


CCGGATATGC 


AAGTGAGCCC 


AGCCTGGACC 


CCCCGGCGAG 


GCCCAGCAGC 


180 


ACCAGCCCAG 


GCCCGAAAAC 


CTTAAGAAAT 


GACCAGTGTC 


TGCTGCTTTA 


AGCCACCAAG 


240 


CTCTGCGGTG 


GTTTGTTAGG 


CTGCAAGCAT 


GGCTAATTCA 


GAAACTGCCA 


GAAACAAGCA 


300 


CTGCTGTCCC 


CAGCCTGGGA 


CACACAGCAC 


CGCCTCTGCG 


TGGGGAGAGG 


GCACAGGCTA 


360 


AGGGCACAAA 


TGCCATCCCA 


GACCCGGCTC 


TTGTGTGTGG 


AAGGGGCCAC 


TGTGCCATGA 


420 


GGCAGAGGAA 


ACCTTGGCAG 


GACCTTATGC 


CACAGCAATT 


TAAAAGAGAA 


GAAACAGGCT 


480 


GGGCGTGGTG 


GCTCATGCCT 


ATAATCCCAG 


CACTTTGGGA 


GGCCAAGGTG 


GTGGATCACT 


540 


TGAGGTCAGG 


AGTTCAAGAC 


CAGCCTGGCC 


AATATGGTGA 


AACCCTGTCT 


CTACGAAAAA 


600 
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TACAAAATTT 


AGGCAGGCGT 


GGTGGCGGGT 


GCCTGTAATC 


CCTGCTATTC 


AGGAGGCTGA 


660 


GGCAAGAGAT 


TTACTTGAAC 


CCAGGAGGTG 


GAGGCTGCTG 


CAGTGAGCTG 


AGATCATGCC 


720 


ACTGCACTCC 


AGCCTGTGTG 


ACGGAGTGAG 


ACTTGGTCTC 


AAAAAAAAAA 


AAGGAAACAC 


780 


ATCTGACTAG 


TGTGATCTCG 


CAAGGAACAT 


TCCAGACACA 


GTGGAGCTAG 


AAGGTTCTTC 


840 


TCCAAACAAG 


GAATCCCCAG 


GGGATCAAAT 


TGTTTTGCAT 


CGGCCAGACA 


TGGTGGCTCA 


900 


AGCCTGTAAC 


CCCAGTGCTT 


CGGGAGGCTG 


AGGTGGGAGG 


ACTGCTTGAG 


TCCAGGAGTT 


960 


CAAGACTAGC 


TTGGGCAACA 


CAGTGAGAGC 


CCATTAGCCA 


GGCGTGGTGG 


CACATGCCTG 


1020 


CAGTCCCAGC 


ACTGTACTAA 


AAATCTACAC 


GGGGCCGGGC 


ATGGTGGCAC 


ATGCCTGTAG 


1080 


AGTCCCAGCT 


ACTCAGGAGG 


CTGAGGCAGG 


ACGATTCCTT 


GAACCCAGGA 


GGTCACGGCT 


1140 


GCCATGAGCC 


GTGACTGTGC 


CACTGCACTC 


CAGTCTGTGC 


AACAGAACGA 


GACTCTGTTT 


1200 


CGAAAAACAA 


AAAATCATTT 


CATGTCTCCA 


GTTTCTCCAC 


TGGCAAAAGA 


CTCTGTCAAG 


1260 


GTAAAAAATG 


GTTCTGACCC 


ACAGAAATCT 


AAGAAAGGAA 


AAAATATAAA 


AAATAGAAAA 


1320 


TTTAAAAAAG 


AGATGGTCTC 


AGAATAAAGA 


CCAACCTGGG 


CTATGGTTGT 


CACTCTTCCC 


1380 


TCACACCTTA 


GAAAGCTTTC 


TGGCCGCATC 


TGGCCAAAGG 


GCCACCCTGC 


CCCATCTTGG 


1440 


ATCAGTGAGG 


TGCCTTCGAA 


CAAGCCACCT 


GCCCTGGAGC 


CCGTCCTGTC 


TTGTCTGCCA 


1500 


CCGCACGCTC 


AGTAGGGGAG 


GGGAAGTCGC 


TAGGTTTTAG 


TTCACCAGTC 


TCTGGATCAA 


1560 


GACGTGCCAT 


AACCAAGAAG 


CCCCAGCCAC 


ACCCAGACCC 


GATGTGGCCA 


CAAGGGGTGA 


1620 


GCTGGGAAGG 


CCCAGGAAAA 


GGCGGGAGGC 


GGACGAATGG 


AAATGTCATT 


CTGTGGCCAC 


1680 


AGAAATGATC 


TCAACGTTTT 


GTAACTTCCT 


ACCAAGAGGC 


AGTCTTAGCT 


CTGCCCTTGA 


1740 


ACCAGCACTT 


GGTGATGTCG 


CTTGCGTCAA 


TCAAGGCAAC 


AGAAGTGAGC 


AGGAGGCCCA 


1800 


CTTTCCTCTG 


CAACTGTGGG 


CTTACGGGGC 


AAAGAAGTCC 


AGGCCTCCAG 


GTGGAGGATC 


1860 


ACAGACCGGG 


CAAAGCAGAG 


GAGAGCCACC 


CAGCCGAGCC 


TACCTGTGCC 


TCAGACTGCC 


1920 


TCCCTCCAGA 


GACCCCTGTG 


GCCAAGGCCA 


CCCAGACCAG 


CAGGTCCTTG 


CCAAGCTGTC 


1980 


AGCTGACGAC 


AGGGGTTGGT 


GAGGCCGGCC 


CAGACCAGCA 


GAACCACGAA 


CCAACCAACA 


2040 


GAATTAAAAA 


TAATAACAAC 


TATGTCTTGT 


CTTAAGCCAC 


TAAGTTTTGG 


ATGGTTTCTT 


2100 


TCTTTCTTTT 


TCTTTTTTTT 


TTTCGGAGAC 


GCAGTCTCAC 


TCTGTTGCCC 


AGGCTGGAGT 


2160 


GCAGTGGCGC 


AATCTTGGCT 


CACTGCAAGC 


TCTGCCCCCC 


GGATTCACGC 


CATTCCCCTG 


2220 


CCTCAGCCTC 


CTGAGTAACT 


GGGACTACAG 


GTGCCTGCCA 


TTGGGTGTTT 


TCTTAAACAG 


2280 


CAAAAGAAAA 


CTGACACAAT 


CATAAACAGA 


GCAAGCAAGA 


GAACTTGGCA 


ATTATTTCCT 


2340 


CTCTACTTCT 


CACTGTTCTT 


CAAAGAGTTA 


ACTCAAGCAT 


AAGATGTGAG 


CAAATTCTTT 


2400 


TAACATCCTA 


GAAAAAAAGC 


TCCTACTCAG 


TGTTCATAAA 


GCAAAGCTAA 


CCTACAGGAG 


2460 


CCACCTTCCA 


CAGTGACCAC 


AGGAAACCAA 


GACAGCAAGT 


GGGACACCAG 


CCTCCAGGGC 


2520 


ACTGCGCCAG 


CCGTGCGCCT 


GTGTCTGCCA 


CTGCCCTGGT 


CCGTCACTGC 


CACCAGCCGG 


2580 


CAAGACACCC 


ACAGAGGAGA 


GCTCTAAGCC 


ACAACTGTGT 


ACGAAGACAA 


CTGTGCAGGA 


2640 
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TTTTATTACT 


ACAACATTTT 


TGTTTTCTTT 




TTTGAGACTG 


AGTCTCGCTC 


2700 




TGTCACCCAG 


GCTGGAGTGC 


AGTGGCACAA 


TCTCGGCTCA 


CTGTAACCTC 


CATCTCCCTG 


2760 




GTTCAAGCAA 


TTCTCCTGCT 


GCAGCCTCCC 


AACTGGATTA 


CAGGCGCCCG 


CCACCACGCC 


2820 




TGGCTAATTT 


TTGTACTTTT 


AGTAGAGATG 


GGGTTTCACC 


ATGTTGGCCA 


GACTGGTCTC 


2880 


5 


AAATTCCTGA 


CAAGTGATCC 


ACCCACCCTG 


GCCTCCCAAA 


GTGCTGGGAT 


TACAGGTGTG 


2940 




AGCCACTGCG 


CCTGGCCCAT 


TTTTGTTTAT 


CAATAAAAAT 


GTACTTAATG 


TTGAACTCTC 


3000 




CACATTTCAA 


ATGGGTAACT 


CCAGTGTCCT 


TGATGCTCCT 


GCGACATGTT 


CGTGAGACTT 


3060 




CTCTTGGGTG 


TGAGAGTCTA 


GCATGTGGGT 


GGTCTGGACA 


GGAGGGGGAG 


GGAAGAGTGC 


3120 




AGAGCCGGGC 


AGGGTAAAGA 


GACCCCCTAG 


GATGTGAAGG 


CCGCCCTGCA 


TTTGTCAGAC 


3180 


10 


TGGGCAACAC 


CCACTCCATC 


AGATGGACCC 


TGGTATGGGC 


GGCAAGCCAC 


CTAGGTGCCG 


3240 




AGGCAAGAGA 


CCGAGGGCAC 


GAGCTGTTCC 


GGTGTAATAA 


AATGCATAAA 


ATAAGAATAG 


3300 




TTATACTAGA 


TATAGATCAT 


AAATATGATT 


ATATATGAAT 


ATCATTCATC 


ATTAGTTTGT 


3360 




AGCAATTACT 


CTTTATTCCA 


ATATTATAAT 


AATCCTTGCC 


TAAGCATAAC 


CTAGGAAAAA 


3420 




CTAGGAAATC 


ATAACCTAGG 


AAAAACTAGG 


CCATACAGAG 


ATAGGAGCTG 


AGGGGACATA 


3480 


15 


GTGAGAACTG 


ACCAGAAGAC 


AAGAGTGCGA 


GCCTTCTGTT 


ATGCCTGGAC 


AGGGCCACCA 


3540 




GAGGGCTCCT 


TGGTCTAGCG 


GTAACGCCAG 


CATCTGGGAA 


GACGCCCGTT 


GCCAAGTGGA 


3600 




CCGTGGTCTA 


GCGGTAGCCT 


CAGTGTCAAG 


GAAAAACACC 


CGCTACTTAG 


CAAACCAGGA 


3660 




AAGAGAGTCT 


CCCTTTCCCC 


GGGGGAGTTT 


AGAGAAGACT 


CTACTCCTCC 


ACCTCTTGCG 


3720 




GAGGGCCTGA 


CATCAGTCAG 


GCCCGCCCGC 


AGTTATCCGG 


AGGCCTAACC 


GTCTCCCTGT 


3780 


20 


GATGCTGTGC 


TTCAGTGGTC 


ACGCTCCTAG 


TCCGCCTTCA 


TGTTCCATCC 


TGTGCACCTG 


3840 




GCTCTGCCTT 


CTAGATAGCA 


GCAGCAAATT 


AGTGAAAGTA 


CTGAAAGTCT 


CTGATAAGCA 


3900 




GAAATAATGG 


CGTAAGCGGT 


CTCTCTCTCT 


CTCTCCTCTC 


TCTCTGCCTC 


AGCTGCCAGG 


3960 




AAGGGAAGGG 


CCCCCTGGCC 


AGTGGGCACG 


TGACCCACAT 


GACCTTACCT 


ATCACTGGAC 


4020 




ATGGTTCACA 


CTCCTTACCC 


TGCCGCTTTG 


TCTTGTATCC 


AATAAATAGC 


GCAACCTGGC 


4080 


25 


ATTCGGGGCC 


GCTACCAGTC 


TCCGCGTCTT 


GGTGGTAGTG 


GTCCCCCAGG 


CCCAGCTGTC 


4140 




TTTTTCTTTT 


ATCTTTGTCT 


TGTGTCTTTA 


TTTCTACACT 


CTCTCATCTC 


CGCATACGAG 


4200 




GAGAAAACCC 


ACCAACCCTG 


TGGGGCTGGT 


CCCTACACCC 


TGGCTTTGTA 


GACTGGAGCC 


4260 




TAGGCACGAC 


TCAGCTGCTG 


TAGTGAATTG 


CGATCCTCCA 


AACCCAGCAA 


GGCACCTGCA 


4320 




GGACATCTGG 


CCCAGTCTCC 


TCGTTGAGCC 


AGTTCACGAA 


AAAGAGACTT 


TTCTGAGTGA 


4380 


30 


CATGCTAATG 


GGCAATATGA 


GGACTAAATG 


GGATGGTCTC 


CAACTTGGAC 


AAACCAACAG 


4440 




TAAAAGCCAC 


TTTGCGGGGA 


AAGAAACTTT 


TCCTTTTTTC 


TTTTTTTTGA 


GACAGGATCT 


4500 




CACCCTGTCA 


CCCAGGCTGC 


AGTGCAGTGG 


CATGACCTTG 


GCTCACTGCA 


GCCTCAACCT 


4560 




CTCTCAGGCT 


CAAGCAATCC 


TCCCGCCTCA 


ACCTCCCATG 


CAGCTGGGAC 


CATAGGTGCA 


4620 




TGCCACCACA 


CCCAAATAAT 


TTTTATATTT 


TTTGTAGAGA 


CGAGGTTTCA 


CTATGTTGCT 


4680 
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CGGGCTGGTC 


TCAACTCCTG 


GGCTCAAGCA 


ACCCTCCCAC 


CTCAGCCTCC 


CAAAGTGCTC 


4740 


AGATTACAGG 


CAGGAGCCAC 


CAGGCCTGGC 


CAACATAGGA 


AGAAATTTAA 


ATTTGAATTG 


4800 


AATATTAGAA 


GAGATGAAAA 


TTCATCAACA 


TGGAAAGACA 


AAGATCATTA 


ACTAAAGCCA 


4860 


AACCAGAATG 


GAAGCTGTGT 


GTACAGTGGG 


GTCTCATGCT 


GGGAACGCGA 


GGGGCACGTG 


4920 


CAGGGCTCCA 


CGGTGTGGCG 


ACGCCCCATG 


CTCCCTTTGT 


GGGGGTTCAT 


CCAGCGGAAC 


4980 


ATGAGGACCT 


GGGGTGCTTT 


TCAACATGTA 


CGTGAGTTTA 


ATAATAAAAA 


GGTTTAAGGA 


5040 


AAGAAAAATT 


CATATGTTTC 


TATATAAACA 


GAACATCTGG 


AAAGATCTAT 


TCTAAGGTGT 


5100 


TGACAGTAGG 


AATCTCTAGG 


TAGTAGTAAT 


ATGGCCTTTT 


TGAATTTTTG 


CTTATCAGTA 


5160 


TTTTCTAATT 


TTCTTTTTCT 


TTCTAAATAA 


TTCTAGCTAT 


GAAATAATTT 


TCTACCATAT 


5220 


ATATTTTGTA 


ATAAAAATGG 


TTATATTTAA 


TTTTTTAAAG 


GCTGTACAAA 


CTTCCTGATA 


5280 


AAATGGCAAA 


TTAGACACAC 


ACATGTGGGC 


CGGGTACAGT 


GGCTCGCGCC 


TGTAATTCCA 


5340 


GCACTTTGGG 


AGGCTGAGGC 


AGGCAGATCA 


CCTAAGGTCA 


GGAGTTTGAG 


ACCAGCCTGG 


5400 


CCAACATGGT 


GAAACCCCGT 


CTCTACTAAA 


TATACAAAAA 


TGAGCTGGAT 


GTGGTGGCAC 


5460 


ACACCTATAG 


TGCCAGCTAC 


TTGGGAAGCT 


GAGGCAGGAA 


AATTGCTTCA 


ACCCGGGAGG 


5520 


CAGAGGTTGT 


AGTGAGCCGA 


GATCATGCCA 


CTGCACTCCA 


GCCTAGGCAA 


CAAGAGCGAG 


5580 


ACTCCAACTC 


AAAAAAAAAT 


AAAAATAACA 


CACACGTGAA 


TAGGCTCCTC 


ATGGAAGTCA 


5640 


TCACAACAAT 


GCAGAGGGAA 


GAGCTTCCAA 


AGTGTAAACC 


CAGAAGCGAG 


GAGCAGGAGG 


5700 


GTGCGCGCAG 


ACGCAGAGAG 


CAGCAAGGTG 


CAGACTGAGA 


GGCGGAGGCT 


GGCCGTGGGG 


5760 


AGATGACTGA 


TGCTCAGTTT 


ATACCCCAAA 


TCCGTAAATC 


TAGAGGCCTG 


GCACATCAAC 


5820 


TACCTCTGCC 


AGCAGGAATG 


AGGGAAAGGA 


GGGCAACCAA 


AAGATGTCCC 


ACCCTCACCC 


5880 


ATCCAGCTAC 


CTGCCATCCT 


CAGCCCCACT 


GGCAGAAGAC 


CCTGAGAGGT 


GGAGGCAGGC 


5940 


CCCTGCCTAC 


AGGACCCTGA 


GAGCTAGGGG 


AAGGCGTTAT 


CCTGAACTGT 


GTCCCCCGTA 


6000 


AAATTCATAT 


GTTGAAGGCC 


TCATCCCCAG 


TGTGACTGTA 


TTTAAAGATG 


GGGTCTTCAG 


6060 


GAGATAATTT 


AAATGAGGTC 


ATATAAGTTG 


GCCCTCATCC 


AGTAAGACTT 


TGACCTTCTG 


6120 


GTGGTTTTTT 


TTTTTTTGGA 


GACTGGGTCT 


CACTCTATCA 


CTCAGGTTGG 


AGTACAGTGG 


6180 


CACGATCACG 


GCTCACTGCT 


GTCTCCAACT 


CCTGGGCTCA 


GGTGATCCTC 


CTGCTTCAGC 


6240 


CTCCTGAGTA 


GCTGGGACTA 


CAGGTGCTTA 


CCACCGCACC 


CAGCTGGTGG 


TGCATTGTGT 


6300 


TTTTTGTAGA 


GATGGGGTTT 


TGCCATGTCG 


CCCAGGCTGG 


TCCTGAACTG 


GGCTCAAGTG 


6360 


ATCTGTCTCC 


CTCGGCCTCC 


TGCAGTGCTG 


GAATTACAGG 


TATGAGCCAC 


CGCGCCTGGC 


6420 


CGACCGTGAC 


CTTCTAAGAA 


GTGAAAGAGA 


AAGATCTTTC 


TCTCTCCCTC 


CCTCTCCATC 


6480 


ATGAGGACAC 


AGCAAGAAGT 


CGGCCATCTG 


CAAGGTAGAA 


AGCGAGTCCT 


CCCAACAGCT 


6540 


GAACCTGGCA 


GACCCTGATC 


TTGGACTTCA 


GCCTTCAGAG 


CTGTAAGAAA 


ATAACTCTCT 


6600 


GCTGTTCAGG 


CCACGCGGTC 


TACGGCAGCC 


CGAGCAGACT 


AAGACACACG 


CCATCTGGGG 


6660 


AGTCAGACCA 


GATCAGGAAG 


AAAGGCCTAG 


AGCTCAGGAT 


ACTGAAGGTC 


CCAACCCGGT 


6720 
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GCTGGACCAG ACCACCCCGG CAGCCGCGGC 
GGACACCATC CCGGCAGCCG CGGCCACGGA 
CATCCCGGCA GCCGCGGCCA CGGTGTCACG 
GGCAGCCGCG GCCACGGTGT CAGGGCTCAG 
5 ACCCCGAGGC TGCTTCCACC CAGACGCCTG 
GGATCAGAGC AGCCTCTCCC CAGGTCAGAA 
GGCACCATGA GAAAGCTACG TGGACTTCTC 
ATTGCAGGCC ATGTCAGGGC TCAGCTGAGG 
CACGGCGTCC CAAGGGAGGG ACTTGGGCAC 

10 GGGTGGGAGA TGTCTGGAAA CATCATGGAC 
CAAGGTGCTG ACAAAGGAAA AAGGAGAATG 
CTCAGAAACT CCTGCATTTC CAGACATGGC 
CAGGGTACCT CACACCAGGA ACCCACACAG 
GCCTCCACCA CCCTGACATG CAGGAGGGAG 

15 ACAGCAAGGG AAGAAAGGCA GGAAGGAACT 
CTGCACGCTG TCCACCAAGG GAGGTCTGAC 
TCGTCCTCTG TCAAACGGTG AGAAAGTCAA 
CATGAAGATG CCACACATGA GACACAGCAA 
CCAGAGCCCC AGAGCCCTCT GGGGAGGAAG 

20 GCTGACTCTG CCTTTATTTC ACCCCAGCAG 
GCAGCTCTGC CACCTCCTAG CAGTTCCACC 
GGCAAATGTT AGCGTTTTTG CCAACTAACA 
GAAGAAAACA CTCACAGTTT CATGAAGACT 
TTATGAAAGG CCAGCTCTCT GACATCCCTA 

25 TTTCAGGACG TGGAGCCACG TGGCCATGCA 
GGGTCTTCTG GTCCCCGTGC CATTTCTGCT 
GAGTCCAGTC TCAGAAGAAC TGGAACTAGA 
AATGCCAGCT GTGCCAAGGA CAGCCCAACC 
GCCCGAGCTT ACCCGTGAGG GGCCAAGTTG 

30 ACACCACTGC CGTCCACTGC AGTGTCCGGA 
ACACTCACGT CGGGAGTCGC CATGGCTCAG 
GTCCTCCAGG GTCCCAAGTC CTGGAGGAAG 
CACATCCACA CAACATAGCA CTCACAAAGG 
TGAGTCCCAG ACTGCTACCT CCTGACAAAA 



PCT/US00/33065 



CACGGAGTCA 


CGGCTCGGGT 


GAGGTGACCT 


6780 


GTCACGGCTC 


GGGTGAGGTG 


ACCTGGACAC 


6840 


GCTCGGATGA 


GATGACTCGG 


ACACCACCCC 


6900 


GTGAGGAGAG 


TTGGATATGG 


GACTGGGCCT 


6960 


GGTGGGTGAC 


ACGAAAGCTG 


GGCTCAGTTG 


7020 


ATGACCCTGG 


GCTCCTCACA 


GTAGCCCTAG 


7080 


TGACCAAGGG 


TCACTGCTGC 


CACACTACTC 


7140 


AGACGTGGAC 


ACCACCCCAG 


CAGCCGCGGC 


7200 


TGCCTCTCTG 


GGCAAGAGTG 


GGGAGGTGTG 


7260 


ACATGCCGGG 


AAAACACGGA 


AGCTGTGCAC 


7320 


GAGGTGTGAA 


CATCCAGCTA 


GCAGGTCCCA 


7380 


CACCAGCTCT 


GTGGATGAGA 


CAGGGGAGGA 


7440 


GTCCATGTCT 


TGCTCTGTGA 


TCACACAACA 


7500 


GTCAAAGCCT 


CGGGTCCAAC 


AACAGGCTCC 


7560 


CAGGGCCAGG 


TCCTCCCAGG 


CAGCAGCTGC 


7620 


CTACACCGCA 


CAGGGGTTGG 


CAGTCTAGAG 


7680 


AAGCTCATGC 


TCAGTGATAT 


GCTAGGTCAG 


7740 


GGATGAGACC 


AACGGGAAGA 


CTGCCCCAGA 


7800 


AATAAGGATG 


GCAGCCTGGG 


ACTGCCCGGG . 


7860 


AGGCAGGAGT 


GACACCGGCT 


CACAGCAGGA 


7920 


TACGGGCAGC 


AAAACAAAGC 


TGGCAGTTTG 


7980 


TTTGAATCGG 


ACATCTGGTA 


CAGAGATGAG 


8040 


GTCAAGAAAA 


TCACTGACTC 


TTCACTTCAT 


8100 


CCACTCCCTC 


TCACATGAGA 


AATCACGGCC 


8160 


GGTACGGGAG 


GCCTCCCCGC 


AGCTGCAGCT 


8220 


TTTCTTCGCT 


CTCTACTTAC 


ACACACATTT 


8280 


AAAATCCTGA 


CACTTGTCCC 


TTACTACGTT 


8340 


CAAGCCCCCA 


TCAGCCCCAA 


TGGCACCGAG 


8400 


GTCGTCACCA 


ACACGGTCTT 


CACCCCCTCC 


8460 


GTTGTCACAA 


CCACCACCTC 


CTCCATGTGC 


8520 


CGGAAGGGGA 


CGCCCAGGCC 


AGCAGCGTCA 


8580 


CAAGGCAGGG 


CACAGGGATG 


GAGTCATCTC 


8640 


CATCTCTAAT 


CAGCTCCAAA 


GACCCACCCT 


8700 


ACGAGCGGCA 


ACAGAAGGGC 


TACTCCAGGC 


8760 
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TCTGGTTCCG AGGGCGGTGT AAGCGCACTC 
CCCTTGGGTA GAAAGCACAG AGCCACTCCC 
AGGGGCCTGG AATTCCAAGC AACTTCCCTG 
GTCTCTCCTG GCCTGAACTC AAAGCCAGCC 
5 AAGCACTCCA AGAAGTCCTC GCTGGTCTGT 
AAAGAGAGGA GGGAAACGCC CTGGGTTTTT 
TCTCGCTCTG TCGCCCAGGC TGGAATGCAG 
CCTCCTGGGT TCAAGTGATT CTCCTGCCTC 
CACCATGTTG CCCAGGCTGG TCTCAAACTC 

10 TCCCAAAGTG CTGGGATTGC AGGTGTGAGG 
ATTATTATTA TTTTGTCTTT CCTGGCTTTG 
CGGTGATAAT GGCTGTCCTG GCCCAAAAAG 
AAAGGAATGG TGGCCCAAGA GTGTGGGGGC 
ACCTAGCACA GGGAACACAA AAGCCCCTGG 

15 CAGAGTGTGG GGGAGGCGAC AAGAGGAGAG 
GTGAGCGCCT GGGTCGTCCC AAGACAGTCT 
GAAGAAAACT GACCTACAGG ATAGGCCACT 
ATATGGGACA GATCCACAGT GCACTGGAAA 
GACACAGGAA CGGGGTGCAA AATTTGCAGC 

20 GAAAAAAAAA AGTCTACATT CTCTGTGAAA 
GTAGTCAAAG CATCCAGAAC ACGATCAAAC 
CAAAGGAACA AATACACAAG AGAAGCTGGC 
TGCTATCGGA CCAGCTCCAA GAGGAAAGCA 
TGGTATAGAA ACAGGAGTTA TAACCAAACA 

25 AGAAAATACA CAAAGCTGGG ATAGTCTCAG 
TGAGCCTAGG AGATTGAGGC TGCAATGAGC 
CAACAGAGTG AGAACTCTCT CAAAAAACGA 
GGGGCTGGGG GAGGGAGTAC AGGGAGTTGT 
ATGATGAAAA GCTCTGGAAA TGGACGGCGG 

30 GAATGGTGCT GAACCACACA TTTAAAAACA 
CCTGTAATCC CAGCACTTTG GGAGGCGGAT 
GGCCAACACA GTGAAATCCT GTCTTGACTA 
CAGGCACCTG TAGTCCCAGC TACTTGGGAG 
AGGCAGAGGT TGCAGTGAGC CGAGATCGTG 
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CACCCGTTTT 


TCCCACTGGA 


TAAGCCGAAA 


8820 


TCCACGTGGG 


GCTCAGAGCA 


GGAGGACAGG 


8880 


GACGCAGGCT 


CCCGGCTTGC 


CAGTTCTTCC 


8940 


CCAATCCCTG 


AACTGAGTTT 


CAGGTGCAGA 


9000 


GGAACGGGAA 


GGGAAACCCA 


TTCAAGACAG 


9060 


TTGGGTTTTT 


GGGTTTTTTT 


TGAGACGGAG 


9120 


TGGCACGACC 


TCGGCTCACT 


GCAAGCTCCA 


9180 


AGCCTCTCCA 


ATTGCTGGGA 


TTACAGGTTT 


9240 


CTGACCTCAG 


GTGATCCACT 


CACCTCGGCC 


9300 


CACCATGCCT 


GGCCTGCCCC 


GGGTTTAAAA 


9360 


CCTTCAGCAA 


GTCCAACCCC 


TGCTAAAACC 


9420 


CTTGGAGACA 


GGGGAATCTT 


CCTCCTGACT 


9480 


TCCCTGTTGC 


CCTCTCACTC 


TCCATCCCCT 


9540 


TTTCCAGCCA 


GAGGGCAACG 


AGCCTGGAGT 


9600 


GGGAGAAGAG 


AGGATGGCAC 


ACAGCTGTGT 


9660 


CTACGTGGTC 


CTGACCCTAA 


AGGGCAAAGG 


9720 


GCCCAGGTCT 


CAGATGGGCC 


CCAGTGGCGC 


9780 


GTCTCTAAAA 


TAAACTGGCC 


TAAGAACACA 


9840 


CTGAACCTAA 


CCAGGTCGAT 


TTCTTGCTAT 


9900 


CTTAAAACAA 


GACCTAGAGT 


CCATAGCACA 


9960 


TTCCTGGCAA 


AGGGTAGTCT 


GGTTGATTCT 


10020 


TCTTGAACGC 


AGAATCCAGA 


GACTTTCAGG 


10080 


AACATTGTCA 


ACCAAGTGGA 


AAGAAAATCT 


10140 


GAAATGTGAA 


AATTAAAAAC 


GACAACCAAA 


10200 


CTACTCGGAA 


GGCGGGGCTG 


GAGGATCGTT 


10260 


TGTGATCACA 


CCACCGCACT 


CCAGTCTGGG 


10320 


AAAAGAAAGA 


AAGTAGAACA 


GAAGTGACCA 


10380 


TCTTTAATGA 


GTACAGAATT 


TCTGTTTGGG 


10440 


TGATGGCTGC 


ACAATCACTG 


TGGCTGTTCT 


10500 


GTTAAAATGG 


GCTGGGCGTG 


GTGGCTCACG 


10560 


CGCCTGAGGT 


CAGGAGTTCG 


AGACCATCCT 


10620 


AAAATACTAA 


AAATTAGCCA 


GGCATGGTGG 


10680 


GCTGGGGCAG 


GAGACCTGCT 


TGAACCCAGG 


10740 


CCACTGCACT 


CCAGCCTGGG 


CAACAAGAGC 


10800 
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GAAACTCCAT 


CTCAAAAAAA 


AAAAAAAAAA 


AAAAAAAAAA 


AAGTTTAAAA 


TGGTTAAATT 


10860 




TTATGTTATG 


TATATTTTAC 


CGTAATAAAA 


ACACTGTAAT 


GCTACTATAA 


TAGAATGACT 


10920 




CATTAGGATT 


AGATATAGAC 


TAGAAAGTAC 


AGAATATAAA 


AACTTTTTAA 


ACAAAGAAAA 


10980 




ATTTTCATGG 


CCAGGCATGG 


TGTCACACCT 


GTAATCCCAG 


GACTTTGGGA 


GGCCAAGGCA 


11040 


5 


AGAGGAATGC 


TTGAGCTCAG 


GGGTTTGAGA 


CCAGCCTGGG 


CAACACAGCA 


ACACCCCATC 


11100 




TCTGCTAAAT 


AAATAATAAA 


AAATAGCCAG 


GCATGGTGGT 


GTGCACGCCT 


GTAGTTGCAG 


11160 




CTACTCTGGA 


GGCTGAGGCA 


GGAGGATCAC 


TTAAGCCCAG 


GAGGTCAAGG 


CTGCAGTGAG 


11220 




CCATGGTTGT 


GCCACTGCGC 


TCCAGCCTGG 


GCAACAGATC 


AAGACCTTGT 


CACAAAAAAA 


11280 




AGAAAGAAAG 


AAAAGAAAAA 


AGAAAGAAAA 


TAAAATCTTC 


CAGAACTTTT 


AAAATCATCA 


11340 


10 


TTGTTAATAT 


AAAAATAACA 


TCACCTGCCC 


CTAGGACTGT 


AACAAACAAG 


TGTGTCTAAG 


11400 




GACAGGAGTG 


GGTCCACCCC 


AACCTGGCAC 


GCAGTGGTCC 


CCTGCGGAGA 


GTCTGGCCCT 


11460 




GCACTCACTA 


AGAGGAGGCA 


CTCATAGCCC 


AGCCAGGCCT 


CTGCAATTAT 


GCCTTCAATG 


11520 




CCAGAACTAA 


CTCACCCAAA 


CTGAACAATC 


GATCACAAAA 


TGTGCCTTCA 


GGTCTCAAGG 


11580 




TTCTTGCTAA 


ATCTTACTCA 


ACCGACATTT 


TCCAGCATGG 


GAACATTTTT 


CTGAATGTCT 


11640 


15 


TAGGGAGAGG 


AAGTCCGCAA 


GAGAACAAAA 


GGTCCTCAGG 


CCACCCTAGC 


TTCTTTTCCT 


11700 




CCATTCCACA 


GGCTGTCTTT 


TGTCTGGGTA 


TGCACTGGAC 


CAGGGGGCTC 


TACTTCTTCC 


11760 




TACCTGGGCA 


TGGGTCTCCA 


CACAACTCCA 


AGGTAAAGGG 


CCACAGGCAA 


GATAAAGGGG 


11820 




AGAAAAGAAA 


GCTACGATTT 


CCTGGGCCAC 


CAATCGCAAA 


TGGCAGCCAG 


TCTCTGAAGT 


11880 




AACCCTTGAC 


CAGAGATCCA 


AGGAACCAAG 


AAATGTAGGT 


GATCTGAACA 


GAGGGGATGG 


11940 


20 


TGGTTAAACA 


CCATGAAGGA 


AAGACCCATT 


CTCAAAGAAA 


AGGAAGCAAA 


AAGAAACCGT 


12000 




GGGGAGCTGG 


GtACCACCCG 


CAGCAAAGAC 


CCCGCACGCG 


TTACTGACGC 


CAGCCTGGCC 


12060 




TGGGAGAGCA 


GTGAGTGTGG 


CGGACGGTGA 


GTGGCGGGGA 


GGGCTGTGGT 


AGGTTTAGGG 


12120 




TAAGAAGGGG 


CAGCGCCCAG 


AGCCCAGAGA 


ACACCAGTGA 


GGGCTCCACA 


GGAACACTAC 


12180 




TCAAAGTATT 


CACGGAACAC 


ATCTAAACAC 


AAGCACTAAG 


GACTAAGTGC 


GAGGGACAAG 


12240 


25 


AAAATATTCC 


CCGTTTCCTG 


TTTCAGGAGG 


GTATCGAAAA 


TGAGTGATGG 


AAGGAAAATG 


12300 




TATTGTTTAA 


ATGAGGAAAA 


AAAATTTTTA 


CAAATTAAGA 


ACATCCTGGA 


ACATGATGAG 


12360 




CCGTTTACTG 


TCACTCAATT 


TAAATGGTGG 


CCATCTAGGA 


CAGAGCGCCT 


AAGGGGAAAG 


12420 




GGGGCTCACA 


GGTGAACCCC 


TCCAGCTGCT 


GGTGGGCAAT 


TTCCCATTAG 


GGCATCAGGG 


12480 




TCTCTGAAGA 


CTGTCTTCAG 


ATGCTTTTTA 


GCCAGGAAAG 


TTACAATGAT 


GAATTCGTTT 


12540 


30 


ACACTGGCGG 


AATTACTTCG 


TATTTCTCAA 


ATATAATGTT 


TTCACTAGCA 


TAACTTTGTT 


12600 




GTTGTAGACT 


TAGGCTTCAA 


AATAAAGAAC 


TTTAAACAAA 


CATGAATAAA 


AAGCCACTTT 


12660 




AGGCCGGGCG 


CGGTGGCTCA 


CACTTGTAAT 


CCCAGCACTT 


TGGGAGGCCG 


CGGCGGGTGG 


12720 




ATCATAAGGT 


CAGAAGTTCA 


AAGACCAGCC 


TGATCAATAC 


GGTGAAACCC 


CGTCTCTACT 


12780 




AAAAATACAA 


AAATTAGCCG 


GGCGCGGTGG 


CAGGTGCCTG 


TAATCTCAGC 


TACTTGGGAG 


12840 
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GCTGAGGCAG 


GAGAATCGCT 


TGAACCTGGG 


CAGCAGAGGT 


TGCAGTGAGC 


CAAGATCATG 


12900 




CCACTGCACT 


CAAGCCTGGG 


TGACAGAGTG 


AGACTCTCTC 


TTAAAAAAAA 


AAAGCCACTT 


12960 




TAAAATTTTA 


CTCAGGCCAG 


GTGTGGTGGC 


TCACGCCCAT 


AATCCTAGCA 


CTTTGGGAGG 


13020 




CCGAGGCGAG 


CAGATCACCT 


GAGGTCAGGA 


GTTAGACCAG 


CCTGGCCAAC 


ATGGTAAAAC 


13080 


5 


CTTGTCTCTA 


CTGAAAACAC 


AAAAATTAGC 


TGGGCGTGGT 


GGTGTGCCCA 


TGTAATCCCA 


13140 




GCTACTCAGG 


AGGCTGAAGT 


GAGAGAACTG 


CTTGAACCCG 


GGAGGCAGAG 


GCTGCAGTGT 


13200 




GCCAAGACTG 


CACCACTACA 


CTTCAGCCTG 


GGCGACAGAG 


CAAGACCCTG 


TCTCAGAAAA 


13260 




AAAAAAAATT 


CAAAAATTTG 


GCCAGGCGTG 


GTGGCTCACG 


CCTGTAATCC 


CATCACTTTG 


13320 




GAAGGCCGAG 


GCGGGTGGAT 


CACCTGAGGT 


CAGGAATTCA 


AGACCAGCCT 


GGCCACCATG 


13380 


10 


ATGAAACCCT 


GTCTCTACTA 


AAAATACAAA 


AAAAAAAAAA 


CAAATTGGCC 


GGGCATGGTG 


13440 




GCGGGTGCCT 


GTAATCCCAC 


CTACTTGGGA 


GGCTGAGGCA 


GGAGAATCTC 


TCGAACTCCG 


13500 




GAGGCAGAGG 


TTGCAGCGAG 


CCAAGATTGT 


GCCACTGCAC 


TCCAGCCTAG 


ACAACAGAGC 


13560 




GAGACTCTGT 


CTCAAAAAAA 


AAAAAATTAA 


AATTAAAAAA 


TAAAAATTTC 


ATTTAAAATA 


13620 




CTACTGATCT 


CCCGTGCTGA 


CTTCTCGGGG 


TTTAACTCTC 


ACTGAGGAGA 


CGCTGCTTTC 


13680 


15 


ATAAGGGTAA 


GCTCAGCAGG 


GGCAACTAAA 


GTCATTTAAG 


CAGAGAGCTG 


CAAAGAGGCA 


13740 




ACAGCCTCAC 


TGCAGGCAGG 


GGTCCTCGTC 


ACAGCTTCAG 


GGCTTTGCAG 


AGGATTACGC 


13800 




AATGTACACG 


CACAAAACTG 


AATTCCAGCC 


TCTCCATTGG 


CAACTGCATA 


CATACATATA 


13860 




TTCTTTTTTT 


GAGACGGAGT 


CTCGCTCTGT 


AGCCCAGGTT 


GGACTGCAGT 


GGCCCGATCT 


13920 




CGGCTCAATG 


CAAGCTCTGC 


CTCCCGGGTT 


CAAGCGATTC 


TCTTGCCTCA 


GCCTCCTGAG 


13980 


20 


TAGCTGGGAT 


TACAGGCGCC 


CACCACCACG 


CCCGGCTAAT 


TTTTGTATTT 


TTAGTAGAGA 


14040 




CGGGGTTTCA 


CCATGTTGGC 


CAGGACAGTC 


TCGATCTCCT 


GACCTCGTGA 


TCCGCCCGCC 


14100 




TCTGCCTCCC 


AAAGTGCTGG 


GATTACAGGC 


GTGAGCCACT 


GAGCCTGGCC 


TCCAATGGCA 


14160 




ACTATATTAA 


AGGTTCAAAG 


CAATATGCAC 


AAAAGTTACC 


TCACAGAAAA 


TAGTGCAAGT 


14220 




CCTTGATACA 


ATGCTCTTTA 


GACACAGAAG 


AAGCACTATA 


GAATAGAGCA 


CCTCGCCCTA 


14280 


25 


TTGCCTTCCC 


AAGGGCGAGC 


ACCCCCTCCT 


CTCTCCACAG 


CTCCTTCTTT 


GTTTTTTTGA 


14340 




GATGGAGTCT 


CGCTCTGTCA 


CCCAGGCTGG 


AGTGCAATGG 


CAAAATCTTG 


GCTCACTGCA 


14400 




ACCTCCGCCT 


CCCGGGTTGA 


AGTGATTCTC 


CTGCCTCAGC 


CTCCCGAGTA 


GCTGGGACTA 


14460 




CAGGCACCCA 


ACACGCCTAG 


CTAATTTTTG 


CATTTTTGGT 


AGAGACGGGG 


TTTCATCATG 


14520 




TTGGCCAGGC 


TGGTCTCGAA 


CTCCTGACCT 


CCAGTGATCC 


TCCCACCTTG 


ACCTCCCATA 


14580 


30 


GTGCTGGGAT 


TATAGGTGTG 


AGCCACTACA 


CCTGGCCTCT 


CCACAGCCCC 


TTCTGTGTTG 


14640 




AAGCCAAGAC 


CCACCCAGCT 


TTGATCCCAA 


GGCTTGGGTT 


CCCCACTAGT 


GTGAAGTGAG 


14700 




TTTCCAAATT 


ATTAGGTAAA 


TCAGATATGA 


GAAAATATTT 


TATTTTACTT 


TTTTTTTTTT 


14760 




GAGACGCAAT 


CTTGCTCCGT 


CACCCAGGCT 


GGAGTGCAAT 


GGCACCATCT 


CCACTCACTG 


14820 




CAACCTCTGC 


CTTCTGGGTT 


CAAGCAATTC 


TCCTGCCTCA 


GCCTCCCAAC 


TAGCTGGGAT 


14880 
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TACAAGTGCA CACCACCACG 
CCGTGTTAGC CAGGCTGCTC 
AAAGTGGTGG GATTACAGGT 
AGTATTCTTG ACCGGCACGG 
5 AGCAAAGAAA GAGAATTCTG 
GAATTACACA GTAACCAAAT 
AGCTCCGACC AACAGGGTCC 
AGCCTACAGT GACGCGGGCC 
GTCCCAGTGG GGTGATCCCT 

10 CCCTGCGGGT CGGGTGCCTG 
CCTGCGGGGT GGCCCCTGGG 
TGGGGATCGC GTCCCTGCGG 
GCGGGTCGGG TGCCTGCGGG 
GGGTCGCGTC CCTGTGGGGT 

15 CCTGCGGGGT GGTCCCTGTG 
TGCGGGTCGC ACCCCTGCGG 
AGGCCGAGCC CGCGCCCGCC 
CCCGCCACCG CCCGCGCCGC 
CGGCCGCCCC TCCCCCGGCC 

20 ACCCCTGACT GGACGGCGGC 
GGCTCGCCGG CGCCGCTTAC 
GGCGGCGTCG GGAGCTGCGG 
GTTGGGGATT CGGCGGCGGC 
GGCCGCGCAC GCACGGGGCT 

25 GGCGGGGCCG TGAAATAAGG 
AGCTCCTAAC GCCGCAGGTT 
CCGCGCGGGC GGCCGGGCCG 
GCCCACGCGC GGCAGGGCGG 
GCTGCTGTGT GTGAATGGGC 

30 GCTGTGGAGG AGGACGCTGT 
CAGCCACGGC GCTCCCAGCG 
GTCACCCCTC AAGCTGTCAC 
ACAGCCGACA CCCCGATCCC 
TAACACCCCG GTCCCACCGC 



CCCGGCTAAC TTTTGTATTT 
TCAAACTCCT GACCTCATGA 
GTGAGCCATC ACACCTGGCC 
TCAACACTGA TGTAATTGAA 
GTTCAACAGA AAAGTCAGTC 
AGATAACATG CCATGACTGA 
ACACCACCAT GGGTCTACAC 
CCTGTGGGGT GGTCCCTGCA 
GCGGGTCGCG TCCCTGCGAG 
CGGGGTGGTC CCTATGGGTC 
AATCGCGTCC CTGCGGGTCG 
GTCGGGTGCC TGCGGGGTGG 
GTGGTCCTTG TGGGTCGCGT 
GGCCCCTGCG GGTCGCGTGG 
GGTCGCGTCC CTGCGGGTCG 
CGTGGTCCCC CCGGGATGGG 
CGCGGCGCCA AGATGGAGGC 
CCGCCTGACG CCGCCGTTGC 
CTCCCCTCCC CCCGCCGTAA 
GCGTGAGCGG AGCGAGAGGC 
CTGGGGCCGC GCCGGGCCTG 
CGGCGGCGGG CGGCGGCGGC 
GGCGGCGCGG GCGCGCGCTT 
GGGAGGGCCG GACACTTATT 
CCCGACGGGC CCCGGGGCGC 
CCTCCTGGTC CCCGAGGCCC 
AGGGACGATG GTCAGTGGAC 
TACCTTCAGG CCTCCAGGTA 
GCGAGGGGAC TCCCCTGCGG 
AGGGTGCGCG GACTCACGCG 
TGGAGGGCGA GGGGCATCCG 
CCCAGTCCCA CAACCAGCAC 
ACCCCTGCCC AACAGCCGGC 
TGTCCCACGG CCGGCACCCC 



TTAGTAGAGA 


CAGGGTTTCA 


14940 


TCCGCCCACG 


TCGGGCTCCC 


15000 


CAAGAAAATA 


TTTTTAAACT 


15060 


ACTGTTGTAT 


TTGAAGTGTT 


15120 


ACGACTTTTC 


AGTCACGCAT 


15180 


CGACGGGCCC 


ACAACAAATC 


.15240 


AGATCCAGGT 


CCCGCCTGTG 


15300 


GGTCAGGTCC 


CTGAGAGTGG 


15360 


TTGGGTGCCT 


GCCGGGTGGC 


15420 


GCGTCCCTGC 


GGGTCGGGTG 


15480 


GGTGCCTGCG 


GGGTGGCCCC 


15540 


CCCCTGGGGA 


TCGCGTCCCT 


15600 


CCCTGTGGGG 


TGGTCCCTGT 


15660 


TGGCCCCTGC 


GGGTCGGGTG 


15720 


GGTGCCTGCG 


GGGTGGTCCC 


15780 


TCCACCGAGG 


AGGCCGCTGG 


15840 


AGGAAGCGCC 


GCCGCCCGCG 


15900 


GCCTGACGCC 


GCCGCCCGCG 


15960 


CGTCCTGACG 


CTCCGCAGGG 


16020 


CTCGCCGCGG 


GGGGGCCGCG 


16080 


CTTAGGCACC 


CGGCGGGGGC 


16140 


GGCCGCGGGC 


TTCGCTCCTT 


16200 


CCTAGTGACG 


CAGGCGGCGG 


16260 


TGGCGCTCGC 


GGAGGAGGAA 


16320 


GTGCGCGGAC 


CGACACTGTC 


16380 


CCGGTCGGGC 


GTTGCCTGCC 


16440 


GGACGGCGCC 


AGGGAGCAGT 


16500 


CGGGCGCTCC 


TCGCCCGGAC 


16560 


GGCGGACGCC 


TGAACACGAG 


16620 


GAACATGCCA 


GAGGCTCAGC 


16680 


GGAGCGGCCG 


GGAGGGCTCG 


16740 


CCCGATCCTA 


TCGCAGTCCC 


16800 


ACCCACCCCA 


ATCCCATAGC 


16860 


GATCCCACCC 


CAGTCCCGCA 


16920 



-29- 



WO 01/42434 



PCT/US00/33065 



GCTGGCACCC CGATCCCACC CCAGCCCAAC 
CCCACAGCCG GCACCCCGAT CCCACCCCAG 
GGCACTCACC CCGATCGCAT AGCATAGCTG 
AGCACCCCGA TCCCACCCCA GTCCCATAGC 

5 ATCACGCCCC AGTCCTATAG CCCGCACCCC 
CATCCCACCC ATGTCCCACA GTCGGCACCC 
GATCCTGTGG CCCTCCTCCA GCCCCCAGGG 
GGGCTGGCTC TGTTGGCCTC TGTGCTGGGT 
CTGACCTCTC CAGGTGTCAT CGAACACCCT 

10 AGGATCCCGG ATTCCGTATC AGGGGACCGA 
GCTTGGATGG AACCCTGACC CTGTGCTCAC 
TCCCGGAACG GGAGGTTCCC AAGCCACTGC 
ATTCATTCAG TTCCCAGTAA AGTAGAACGT 
AAAATTAGAT TGGCCGAGTG CGGTGGCTCA 

15 AGGTGGGAGG ATCACCTGTG GCCAGGAGTT 
CTGGGGAAGA ATAAAAAAAA AAATTGAGCC 
AAATATGAGA GGGTTCACGC GGTCTATGTG 
GTTTTGGAAA TAAATGTCAA TAAAGTCGAA 
AGGGTGGCTA CATCATGCCT GTGTGTTGCG 

20 CCTGGGGCCT GTTCTGCCTC CTTCACTCTG 
GAGAAACTCC ATCTAAAAAA AAAGAAAAAT 
TGGCATAACA AATTATCTCA AACTTAGTCG 
AGGCTCACGC CCATAATCCT AGCACTTTGG 
AGTTCGAGAC CAACATGGCC AGAACTGTCT 

25 GGTGGCTCAC GCCTATAATC CCAGCACTTT 
AGGAGATAGA GACCATCCTG GCTAACACTG 
AATTAGCCTG GCGTGGTGGC AGGCGCCTGT 
AGAATGGCGT GAACCCAGGA AGCGGAGCTT 
CAGCCTAGGC AACAGAGCAA GACTCCGTCT 

30 ATTCTCTAAC TGTTCTGTAG GTCGGAAGTC 
TGTCTGCAGG GCCGATTGCT TTTGGAGCTC 
TTCTGGAAGC ACTTGAGCCC CTTGTCTCGT 
GCCAGTTGAG TCCTCACCCT GTTGGCCCCG 
TCAAGGCCCC TTGTGATGAC ATGGGGCCAC 



AGCTGGCACC CACCCCGATC CCACCGCTGT 16980 

TCCCGCAGCC GGCACCCCGA TCCCACAGCC 1704O 

ATACCCCGAT CCCACCCCAG TCCCATAGCC 17100 

CAGCACCTCG ATCCCATAGA TGACACCCCG 17160 

GATCCCACCC GAGTCCCGCA GCCGGCACCC 17220 

CGATCCCACT CGGATCCGGC AGCCAGCTTG 17280 

CTCATTTATA TGTTTTATTG GCAGAGGCTG 17340 

I 

TTCTTCCTCT GCACCGCAGG ACTGGCTCTC 17400 

TGTGCTTGCT GTCACCCGCT GCCTGTCTGC 17460 

AATTAGTCGG AAAATAGGAA GCAGGTGCTC 17520 

ACTTGTAGGA GGAGGGCTCT GCAGGCCGCC 17580 

ACTTCGGAGG GGCTGTAATT AGAGTTGCAC 17640 

GCTCCAGCCA GTGAGGAAAA GGTGTTTTTA 17700 

TGCCTTTTAC CTCAACACTT TGGGAGACAA 17760 

CAAGACCAGC CTGGGCAACA GAGCCTGTCT 17820 

TTTGTCAGTG CTACTATTTT ATTATCTGGT 17880 

TGTCATTTAT CTGAGTTTGC CTATCGTCAC 17940 

GAGGAGTGCT GAGGGGGGCC TGGGGATGGG 18000 

CAAGCCCACC GAGGTCGGCC TGGGGTGAGC 18060 

GGGCTCCAAG AGACAAACTG GGCAACAAGA 18120 

CACCTCCAAG ATAACTTAGC TTTCTTCTGC 18180 

CTTAAAAATG CAAATTTAGG CTGAGTGCGG 18240 

GAGGCCAAGG CAGGATTGCT TGAGGCCAGG 18300 

CTTTTTAAAA AATGCAAATG TGTCCGGCAC 18360 

GTGAGGCCAA GGCGGGCAGA TCACGAGGTC 18420 

TGAAACCCCC TCTCTACTAA AAATACAAAA 18480 

AGTCCCAGCT ACTCGGGAGG CTGAGGCAGG 18540 

GCAGTGAGCC GAGATGGCGC CACTGCACTC 18600 

CAAAAAATAA ATAAATAAAA CTGCAAATGT 18660 

CAGCCCAGCC TCACTCCGCC AAAATCAGGG 18720 

CAGGGGAGAA GCTGTTCTGG CCTTTCCAGT 18780 

GGCCTATCCC ACACCTGAAA GCCAGCCAAA 18840 

ACACTGATCT CCTGCCTCCC TCATCTGCTG 18900 

CAGCTGGCCC AGGGCACCTC CTGTCAGAGT 18960 
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CCGCCGACCA GTGACCTTCA 
GTTCACAGAT CCCAGGGGTT 
CTTCCTTCTA AAATATTTAT 
AGACAGAGTC TCGCTCTGTC 
5 AACCTCTGCC TCCGGGCAGA 
TTTGTTTTGT TTTTGAGATG 
GCACTTTGGG AGGCCGAGGC 
AACACGGTGA AACCCCGTCT 
CGCCTGTAGT CCCAGCTACT 

10 GGAGCTTGCA GTGAGCCGAG 
TCCGTCTCAA AAAAAAAAAA 
CCAGGCTGGA GTGTAGTGGC 
TATTTTTAGT AGAGACAGGG 
CAGGTAATCC ACCCGCCTCA 

15 TCTGGCCATA TTTATTAACT 
CTCACTCTGT CACCCAGGCT 
CTCCCAGGTT CAAGCGATTC 
CGCTACCAAG CCCAGCTAAT 
GAGGCTGGTC TCGAACTCCT 

20 GATTATAGGC ATGAGCCACT 
GGGTTGGCTG GGCATGGTGG 
GAGGATCGCT TGAGTCCAGG 
CCGCAAAAGC TAAAAAGTTA 
GGAGGCTGAC GTGGGAGGAT 

25 TGTGCCACTG CACTCAGCCT 
GTCCAGGCAC AGTGGCTCAC 
TTCGTTGAGG TCAGGAGTTC 
AAAATACAAA AAAATTAGCC 
GACTGAGGCA GGAGAATCAT 

30 GCCACTGCAC TTCAGCCTGG 
TTGAGTCCTG AGTTCCTCTA 
GAACAATCTG ATGAGGAAGA 
AGTGGCTCAC GCCTGTAATC 
AAGAGATCAA GACCATCCTG 



TTCCATCTGT CGCTGTAATT 
AGGATGTGAA TATCTTGGGC 
CATTTTTGTT TTGGGGATTT 
GCCCAGGTTG GAGTGCAATG 
CGTGAGCCAC TGCACCAGGC 
GAGTCTCGGC CGGGCGCGGT 
GGGCGGATCA CGAGGTCAGG 
CTACTAAAAA TACAAAAAAT 
CGGGAGGCTG AGGCAGGAGA 
ATCGCGCCAC TGCACTCCAG 
AAAAAAAAAA AAAAAAAGAG 
GGGATTATAG GTACGCGCCA 
TTTTACCATG TTGGTCAGAC 
GCCTCCCAAA GTGCTGGGAT 
ACAAAGGGAA AGATGATAAT 
GGAGTACAAT AGCGTGATCT 
TCCTGCCTCA GCCTCCCAAC 
TTTTGTATTT TTAGTAGAAA 
GACCTTGTGA TCTGCCCACC 
GCAACCGGCT GAAAGATGGT 
CTTATGCCTG TAAGCTCAGC 
AGTTTGAGAC CAGCCTGGAC 
GCCAGGTGTG GCGGCACATG 
CACTTGAGAC CAGGAGGTCA 
GGGCAACAGA GCGAGAGTCT 
ACCTGTAATC TCAGCACTTT 
AAAACGAGCC TGGCTAAATG 
AGGCATGGTG ACGGGCGCCT 
GTAAACCCAG GAGGCTGAGG 
GCGACAGAGC AAGACTGTCT 
AGGGAAATTC CAGGCACCTC 
AGATAGAAAC AGCTTAAACA 
TGAGCACTTT GGGAGGCCGA 
GCTAACATGG TGAAACCCCG 



CCCCTTTGCT 


TGGAACCAAC 


19020 


AGGGCTGTGG 


GGGGGCTATT 


19080 


TTTTGGTTTG 


GTTTTTTTTG 


19140 


GTGCAATCTC 


AGCTCACTGC 


19200 


CTGTTTTTGT 


TTTTGTTTGT 


19260 


GGCTCACGCC 


TGTAATCCCA 


19320 


AGATCGAGAC 


CATCCTGGCT 


19380 


TAGCCGGGCG 


TGGTAGCGGG 


19440 


ATGGCGTGAA 


CCCGGGAGGC 


19500 


CCTGGGCGAC 


AGAGCGAGAC 


19560 


ATGGAGTCTC 


ACTTTGTCAC 


19620 


TCATGCCCAG 


TTACTTTTTG 


19680 


TGGTCTCAAA 


CTCCTGATCT 


19740 


TACAGACGTG 


AGCCACCGTG 


19800 




GAGATGGAGT 


19860 


TGGCTCACTG 


AAACCTCTGC 


19920 


TAGCTGGGAT 


TACAGGCGCA 


19980 


CGGAGTTTCA 


CCATGTTGGT 


20040 


TCGGCCTCCC 


AAAGTGCTGG 


20100 


AATTTTAAAG 


TAGAGAAACT 


20160 


ACTTTGGAAG 


TCCAAGGCAA 


20220 


AATATAGCAA 


GACCCCATCT 


20280 


CCTGTAGTCC 


CAGCTACTCA 


20340 


AGGCTGAAGT 


GAGCTGTTAT 


20400 


GTCTCCAAAG 


GTAAAAAAAG 


20460 


GGGAGGCCGA 


GGCGGGCAGA 


20520 


GTGAAACCCC 


GTCTCTACTA 


20580 


GTAATCTCAG 


CTACTTGGGA 


20640 


TTGCAGCGAG 


CCAAGATCAT 


20700 


CAAAACAAAA 


CAAAAGAATC 


20760 


GCCACCCTTG 


ACAGGCAAAG 


20820 


ATAGTCTCCC 


GGCCGGGGGC 


20880 


GGCGGGTGGA 


TCACAAGGTC 


20940 


TCTCTACTAA 


AAATACAAAA 


21000 
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AATTAGCCGG 


GCGTGGTGGT 


GGGTGCCTGT 


AGTCCCAGCT 


ACTCGGGAGG 


CTGAGGCAGG 


21060 


AGAATGGCGT 


GAACCCAGGA 


GGCGGAGCTT 


TCAGTGAGCT 


GAGATCGCGC 


CTCTGCACTC 


21120 


CAGCCTGGGC 


GACAGAGCCT 


CGAGACTCCA 


TCTCAAAAAA 


AAAAAAAAAT 


TAGCTGGGTG 


21180 


TGGTGGCTCA 


CACCTGTAAT 


CCCAGCTACG 


TGGCAGGCTG 


AGGCAGGAGA 


ATCGCTTGAA 


21240 


CCTGGGAGGC 


GGAGGTTGTA 


GGGAGCTGAG 


ATCGCACCAC 


TGCACTCCAG 


CCTGGGCAAC 


21300 


AGAGCGAGAC 


TCTGTCTCAA 


AAAAAAAAAA 


AAAAAACAAA 


AAAACAATAG 


TCTCCCAAGT 


21360 


AAGTCAGAGT 


CACAAGGTGT 


TTTGATTCCC 


TGTGGAAACT 


AAAATATAAC 


AGCTTAACAT 


21420 


ATGTTCTTGA 


GTTATTTTTC 


AGAAACTTGG 


ACATCCACCA 


GGTGGAAAAT 


GCTGAGCTAG 


21480 


GAACAGTGGC 


TATAATTTCA 


GCCTTTTGAG 


AGGCCAAGGT 


GGAAGGATCA 


CTTGAGGCCA 


21540 


GGAGTTAGAG 


ACCAGCCTGG 


CCAACATGGT 


GAAACCCCGT 


CTCTAGTAAA 


AATACAAATA 


21600 


TTAGCTGGGC 


ATGGTGGTGC 


AACCTGAAAT 


CCCAGCTACT 


TGGGAGACCT 


AGCTGGGAGG 


21660 


ATCGCTTGAA 


CCTGGTAGGA 


GGAGTTTGCA 


GTGAGCTGAA 


ATTGTGCCAC 


TGCACTCTAG 


21720 


CCTGGGCAAC 


AGAGTGAGAC 


TCTGTCTCAA 


AAAATAAATA 


AATAAAAAGA 


GAAAAAAGTG 


21780 


TTGCCTGCAG 


GCCGGGCACA 


GTGGCTCACG 


CCTGTAATCC 


CAACACTTTG 


GGAGGCCGAG 


21840 


ATGGGCAGAT 


CACCTGAGGT 


CAGGAGTGCA 


AGAACAGCCT 


GGCCAACATG 


GTGAAACCCC 


21900 


ATCTCTACTA 


AAAATACAAA 


AGTTAGCTGG 


GTGTGTACAT 


GTAGTCTCAG 


CTACTTGGGA 


21960 


AGCTGAGGCA 


GGAGAATCTC 


TTCAACCGGG 


GAGGTGGAGG 


TTGCGATGAG 


CTGAGATCAC 


22020 


GCCACCACAC 


TCCATCCAGC 


CTGGGTGACA 


GAGTGAGACT 


CCATCTCAAA 


GCAAAAAAAG 


22080 


AAACATAGGT 


GGGACCCTTG 


GTGTGTCCTT 


AGGGCATGAT 


GGTTGAGGTA 


TACTGCTGGT 


22140 


CCTGTCATGT 


AAAAGAAAAC 


GAGCCGACTC 


TGTGTCTACT 


GGAGAAAGCA 


CTGCATATAT 


22200 


CAGCCACAGT 


CAATACCTCG 


CTTCTGCAGG 


GACGGTGGCT 


GCCAGAGTGG 


GAGGCTTTGG 


22260 


TAGCACCCAT 


GTCGTGGAAT 


CACAATGTTG 


TCGATAGCTC 


TGGGGTCTTG 


TACAAAATGC 


22320 


CAGATCCTCC 


CATTTGGTTT 


CCTTATGGGA 


AGGATCGCAG 


TACTATAATA 


CATGGGCTTG 


22380 


TGCAAGGGAT 


CATTATACCC 


TTTTCTCTTT 


TTTTGCTTTT 


CTTTGAGACA 


GAGTTTCACT 


22440 


CTCGTCACCC 


AGGCTGGAGT 


GCAATGGCGC 


GATCTTGGCT 


CACTGCAACC 


TCCACCTCCT 


22500 


GGGTTCAAGT 


GATTTTCCTG 


GCTCAGCCTT 


CTGAGTAGCT 


GGGATTACAC 


ATGCCCGCCA 


22560 


CCAGGCCTGA 


CTTATTTTTG 


TATTTTTAGT 


AGAGACAGGG 


TTTCACCAAG 


TTGGTCAGGC 


22620 


TGGTCTTGAA 


CTCCTGACCT 


CAGGTGATCC 


ACCCACCTCG 


GCCTCCCAAA 


GTGTTGGGAT 


22680 


TTCAGGCATA 


AGCCACCAGG 


CCCAGCCTTT 


CTTTCTTTTT 


AAAATTAATC 


TTTGTTTAAA 


22740 


AATACTCTCA 


TTTTTTATTT 


AATTGTAGCA 


CTCCTAGATC 


CCGAAAGCAG 


ATACACTCTT 


22800 


GTTATGGGTC 


TGATTCTTTT 


CATTGCTTCA 


CGCCTTAGAG 


GATATTGTCC 


AATACTGGAT 


22860 


AAAAGTTTAC 


TCAGGTCTAC 


TTCCACTTTA 


ACGGGGATGG 


CTGAATATCT 


CTTCCACTTG 


22920 


GCTGTTTGTT 


TATAATGAAC 


TGACAAACAT 


ACAAATTTTC 


TTGAGTTCTG 


TGAGACATTC 


22980 


TAGTAAATCA 


TCTAACCTGA 


AGAGCAGGTT 


GTGAGAACCC 


CTGATTTAGA 


AAGCCCAGTG 


23040 
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GTCATAAATA 


TAAGTGGCTC 


TGGACTGGCT 


CCCGGGGTCT 


GAAGTGTGGG 


CAGTCGGTTA 


23100 




GGATTGAGCC 


CTTGTAATTT 


GTAGGATCTG 


ACACACACTC 


CAGGAAGGCA 


GTGTCAGAAT 


23160 




TTACCTGTAT 


TATATTGGAC 


ACCCAGTTAG 


CGTTTGGAGA 


ATTGGTTGCT 


GGTATAGAAA 


23220 




AATACCAAAT 


ATTTTATGTC 


AGGGGAGTGA 


AAGAAAAAAC 


AAAAACCCGG 


CCGGGCGCGG 


23280 


5 


TGGCTCACGC 


CTGTCATCCC 


AGCACTTTGG 


GAGGCCGAGA 


CGGGCGGATC 


ACGAGGTCAG 


23340 




GAGATCGAGA 


CCATCCTGGC 


TAACACGGTG 


AAACCCCATC 


TCTACTAAAA 


ATACAAAAAT 


23400 




TAGCCGGGCG 


TGGTGGCGCG 


CGCCTGTAGT 


CCCAGCTACT 


CGGGAGGCTG 


AGGCAGGAGA 


23460 




ATGGCGTGAA 


CCCGGGAGGC 


GGAGCTTGCA 


GTGAGCCCAG 


ATCGCGCCAC 


CGCACTCCAG 


23520 




CCTGGGCGAC 


AGAGCGAGAC 


TCCGTCTCAA 


AAAAAAAAAA 


CAAAAAAAAA 


AAACAAAAAA 


23580 


10 


AAAAAACCCA 


TACACTTTAA 


GGAAAGCAAC 


TGACAGCATT 


TGTTACCAGT 


GATAAAATTT , 


23640 




GAGCTTTGAA 


GTAAGAATAA 


CAATTTTGCC 


ATTGTGCCCG 


GGCCAAGAAA 


AAAAAAAGAA 


23700 




TTTTGCCATT 


GTGAAAGGCT 


TCCCAGTACT 


TTCTGATGAG 


CTTGACGGTG 


ATATTAACAA 


23760 




ATAACTTTTT 




TTGAGATGGG 


GTCTTGCTCT 


GTCACCCAGG 


CTGGAGTGCA 


23820 




GTGGTTCAAT 


CTCAGCTCAC 


TGCAACCTCC 


GCCTCCCAGG 


TTCAAGCGAT 


TCTCCTGCCT 


23880 


15 


CAACGTCCCA 


AGTCGCTGGA 


CTACAGGTGT 


GCGCCACCAC 


GTCCAGATAA 


TTTTTGTATT 


23940 




TTTAGTAGAG 


ATGGGGTTTC 


ACCATGTTGC 


CCAGACTGGT 


CTCAAACTCG 


TGACCTCAGG 


24000 




CGACCCGCCC 


ACCTCGGCCT 


CCCAAAGGTG 


GGAGGCCTTG 


CTGGGATTAG 


AGGTATGAGC 


24060 




CGCTGCACCT 


GGCCTCTTGT 


CCTTGTGTTT 


TGCAGTGATG 


CAATGACCAT 


GTCTTACATT 


24120 




TGCAACCAGA 


AAAAAAGGTT 


AGTGTAACAA 


TGTTTATCCT 


GTTTTTCCCA 


GAGTAGACAT 


24180 


20 


TATGAAGATT 


AAAAAAATTT 


GAAAGTGTTT 


TGAATATAAT 


AAACTATGCT 


ATACACACAA 


24240 




CATTTTGGTG 


ACTAGAAATA 


CAAGTTTATT 


GTTTGTTGTT 


TGTTGAGACA 


GGGCCCTGCT 


24300 




CTGTCTCCCA 


GGCTGGGTGG 


CACAATCATG 


GCTCACTACA 


GTCTTGAACT 


CCTGGGCTTA 


24360 




AGCGATCCTC 


CCACCTCAGC 


CTCCAGAGTA 


GCTGGGACTG 


CAAACGAGCA 


CCACCACGCC 


24420 




TGGCTAATAT 


TTGTATTTTT 


TGTAGAGATG 


GGGTTTCACC 


ATGTTGCCCA 


GACTGGTCTC 


24480 


25 


AAACTCCTGG 


GCTCAAGCAA 


TGCTCCTGCC 


TCGGCCTCCC 


AAAGTGCTGG 


GATCACAAGT 


24540 




ATGAGCCACT 


GCACCCGGCT 


GAGTTTCTGT 


TGTTTTAAGC 


CGCTTCATTT 


GTGGTACTTC 


24600 




TTACAGCAGT 


CCCAGGAAAC 


TGAGCAACTG 


CAGAACATCA 


AAATTGTTTT 


TCTTCAGCAA 


24660 




AAGGAGAAGC 


ACTTGTGGTT 


GGCACCAGCT 


TTTCCTGTGC 


TCACTTCTGC 


ATGGCCGCAC 


24720 




CTTTGCCCGA 


CACGAGTGCA 


CAGCAGGCTG 


TGGGGGAGCA 


ACTGGTTGAG 


TCAGGCCTCC 


24780 


30 


ACTTGTGCCG 


TATCCCCACC 


TGCTTTGCTG 


GACACCCCTG 


TTTGGGGGGC 


ACCCACTGCT 


24840 




GCCCCAGACA 


CCAAGCAAGC 


ACCAGCTGTG 


TCCAAAACTT 


ACAGTCACTG 


TCTTGGCCCG 


24900 




TTTTGTGCTG 


CTGTAACAGA 


ATGCCACAGA 


CTGGGTAATT 


TAATACAGAA 


CAGAAATTTA 


24960 




TTTCCTCAAA 


GTTTTGGAGG 


CTGGGAAGTC 


CAAGAGCAAG 


GGGCCATCAG 


GTCAGGGCCT 


25020 




GGTCTCTGCT 


TCCACGATGG 


CACCTTGACC 


ACCGTGTCCT 


CACGTGGTCA 


GAGAGAGCCC 


25080 
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ACTCCCAGGA 


GCCCTTTTAA 


TAGAGCAGAA 


CACTGCTGCG 


CTGCGGTTAA 


GTTTCCAACA 


25140 




CGTGAACTTC 


GGAGGTGACA 


CATTCAGATC 


ATAGCAGTCA 


CTCTAGGCAG 


AGTGTCTGAT 


25200 




GTGGTTTTAA 


AATACGTTCA 


CAGACTGGCC 


GGGCACTGTA 


GCTCACGTCT 


GTAATCCCAA 


25260 




CAGTTTGGGA 


GGCCAAGGTG 


GGTGGATCAC 


CTGAGGTCAG 


GAGTTCAAGA 


CCAGCCTCAC 


25320 


5 


CAACATGGTG 


AAACCCCATC 


TCTACTAAAA 


ATACAAAATT 


AGCCAGGTGG 


TGCATGCCTG 


25380 




TAATCCCAGC 


TACTCGGGAG 


GCCGAGGCTG 


GAGAATCGCT 


TGAATCCAGG 


AGGTGGAGGT 


25440 




TACAGTGAGT 


CGAGATCATG 


CCATTGCACT 


CCAGCCTGGG 


CAACAAGAGC 


GAAACTCTGT 


25500 




CTCAAAAAAT 


AAAATAAAAT 


AAAATACATT 


CACAAGGCCG 


GGCACTGTGG 


CTCACGCCTG 


25560 




TAATCCCAGC 


TACTTGGGAG 


ACTGAGGCAG 


GAGAATCGCT 


TATAACCTGG 


GAGGTGGAGG 


25620 


10 


TTGCAGTGAG 


CTGAGATCAC 


ACCGCTACAC 


TCTAGCTTGG 


GCAACAAGAG 


TGAAACTCCG 


25680 




TCTCAAAAAA 


GTAAAATAAG 


GCCCTGCAGG 


CATGGTGGCC 


CACACCTGTA 


ATCCCAGCAC 


25740 




TTTAGGAGGC 


CAAGGCGGTC 


GGATCACGAG 


GTCAGGAGTT 


CGAGACCAGC 


CTGGCCAACA 


25800 




TGATGAAACC 


CCGTCTCTAC 


TAGCCTAGCC 


AACATGGGGA 


AACCCTGTCT 


CTACTAAAAA 


25860 




TACAAAAATT 


AGCCGGGCAT 


GGTGGTGCGT 


GCCTGTAATC 


CCAGCTACTC 


AGGAGGCTGA 


25920 


15 


GGCAGGAGAA 


TCGCTTGAAC 


CCAGGAAGCA 


GAGGGTGCAG 


TGAGCCAAGA 


TTGCGCCGCT 


25980 




GCTCTCTAGC 


CTGGGCGACA 


GAGCGAGACT 


CCATCTCTAA 


ATAAATAAAT 


AAAATAAGAA 


26040 




AATAAAATAT 


GTTCACAAAT 


CCTTTGACAT 


TCCTCACCTC 


AAAAGCTGGA 


ACCCAACTCC 


26100 




CTCCTAAGCA 


TGAGTCTTCT 


CAGTGACTCA 


CTTCTAACAG 


CAGAACTTAC 


ATGGTTCCCC 


26160 




ACACCCAGAG 


GACATTGGGT 


TCCTCCCAAT 


ATCCCCCCAC 


CCAGCGACCC 


CCACCCAGGT 


26220 


20 


CGCTGGCTTT 


GGGTCCCCCA 


GAGCCATGTT 


TCAAGGACAC 


TCAGGCAGCC 


CCTGGATGTC 


26280 




CATGTGGTAA 


GGAATGAAGG 


CCTCCTGCCT 


GCAGCCTCGG 


GAGGGAGCAT 


TCTCAGAAGA 


26340 




GGATGCCCCA 


CCTCCTGCCC 


AGCCTTCAGA 


TGGCCAGGAC 


CTCGTCCAAC 


GTCCTGACTG 


26400 




CAACATCATG 


AGAGACTCCG 


AGCCAGAAAC 


CCCCAGGTTT 


TGTACTCCTG 


ACTTATGGGA 


26460 




ACTGACAGAT 


AATGTTCGTT 


GTTAATTAAG 


GGGTGACTTG 


TCACACACAA 


TAGGTCACTA 


26520 


25 


AACAGCTCTG 


TCTGGCCTCC 


CAGGAGGAGC 


CTGCCTTTCC 


TTTTCTTCAT 


GGGAAAAGTG 


26580 




CGATCAGTTT 


GTGAAGGAAT 


GTCCGCCCCC 


ACTTGATGCC 


AGAGGCTCCA 


CATGGTGACT 


26640 




GTCATAAACT 


CCATCTGCCC 


TCAGTGCCTT 


GCCAGCACCC 


GGCCTGCGAT 


CAGCTTGGTC 


26700 




TTGCGGGAGG 


CCAAGGCCCA 


CGTGTGTTTG 


TGTGTGGTGT 


CTGTGTCTGC 


GTGCCCATGC 


26760 




ATGCCCAGGG 


TACAGGGATG 


CCATATACAA 


ATTCTTTCAA 


TGTTGTATGT 


GGCATGTGTG 


26820 


30 


TGTCTGTATG 


CCCAGGATAC 


AGGGATGCTA 


TATACAAACT 


CTGTTTTTTC 


GTTTTTTTTT 


26880 




TTTTGAGACA 


GAGTCTTGCT 


GTTTCGCCCA 


GGCCGGACTG 


CAGTGGCGCT 


ATCTCGGCTC 


26940 




ACTGCAAGCT 


CCACCTCCCG 


GGTTCACGCC 


ATCCTCCTGC 


CTCAGCCTCC 


TGAGTAGCTG 


27000 




GAACTACAGG 


CGCCCGCCAC 


CACACCCGGC 


TAATTTTTTG 


TATTTTTAGT 


AGAGACGGGG 


27060 




TTTCACCATG 


TTAGCCAGGA 


TGGTCTTGAT 


CTCCTGACCT 


CGTGATCCAC 


CCGCCTCAGC 


27120 
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CTCCCAAAGT 


GCTGGGATTA 


CAGGCATGAG 


CCACCACGCC 


TGGCCTACAA 


ACTCTTTCTT 


27180 




TTTTTTTTTT 


TTTTTTTTGA 


GATGGAGTCT 


CACTGTCTTC 


CAGGCTGGAG 


TGCAGTGATG 


27240 




CGATCTCAGC 


TCACTGCAAG 


CTCCACCTCC 


CGGGTTCATG 


CCATTCTCCT 


GCCTCAGCCT 


27300 




CCCAAGTAGC 


TGGGACTACA 


GGCACACACC 


ACCACGCCCA 


GCTAATTTTT 


TGTGTTTTTA 


27360 


5 


GCAGAGATGG 


GGTTTCACCA 


TGTTAGCCAG 


GATGGTCTCG 


ATCTCCTGAC 


CTCGTGATCC 


27420 




GCCCGCCTCG 


GCCTCCCAAA 


GTGCTGGGAT 


TACAGGCGTG 


AGCCACTGCG 


CCCAGCCTGC 


27480 




AAACTCTTTC 


AATGTCTTTC 


TTTTCTCTCT 


CCTGCCATCT 


TCTCCCTTGC 


AGATTTCTTT 


27540 




TGTCTCTACG 


TCTTCCCCAG 


CTGAGTCCGA 


GGTCCTGACT 


TGCCCACGCT 


CCCTGGACTG 


27600 




GAGGAGAGGT 


GATAGCAAGA 


GCTCCTTCAA 


GCCCAGGAAT 


GCCACCAGGG 


CTGCCCCGGG 


27660 


10 


AGAGGAGGAA 


GCTGGGTCTC 


TCGGGGTTGT 


GGGGACCAGA 


CACCCTTCTA 


AGACATGGAC 


27720 




TCAGCACAGA 


AAGTCTAGAC 


ATCCACTACA 


AACACATCTC 


CCTCCTAACA 


GGGGGCCCCT 


27780 




GGGCACCCCA 


AGTGGCTGTT 


TGGTGGGACA 


GGCATGTCCA 


TCAGTCAGAA 


TATCTTTATT 


27840 




TTTTATTTTT 


TATTTTTTAT 


TTTTGAGAGA 


GTTTCACTGG 


AGTGCAATGG 


CACGATCTCA 


27900 




GCTCCCTACA 


ACCTCCGCCT 


CCCAGGTTCA 


AGCGATTCTC 


CTGCCTCAGC 


CTGCCACGTA 


27960 


15 


GCTGGGATTA 


CAGGTGTGAG 


CCACCACACC 


CAGCTAATTT 




TTTTTGAGAT 


28020 




GGAGTCTCGA 


GGCTCTGTCG 


CCCAGGCTGG 


AGTGCAGAGG 


CGCGATCTCA 


GCTCACTGAA 


28080 




AGCTCCGCCT 


CCTGGGTTCA 


CGCCATTCTC 


CTGCCTCAGC 


CTCCCGAGTA 


GCTGGGATTA 


28140 




CAGGCATGAG 


CCACCGCGCC 


CGGCCAATTT 


TGTATTTTTA 


GTAGAGACAG 


GGTTTCACCA 


28200 




TGTTGGTCAG 


GCTGGTCTTG 


AACTCCTGAC 


CTCAGGTGAT 


CCACCTCCCT 


CGGCCTCCCA 


28260 


20 


AAGTGCTGGG 


ATTACAGGCC 


TGAGCCACCA 


CGCCCAGCCC 


AGAATGTCTT 


CTTACTTTTT 


28320 




ATTACTCTGT 


CCCCCATCCT 


GGGTCCAGAC 


CTGTGACCGT 


GAACAACCGG 


CTGCCCAGGG 


28380 




GTGAATGGGG 


TGAGTGGGGT 


GAGTCCACAG 


AACAGTGGGG 


TGCAGCCCCA 


GGGGTCTCGT 


28440 




AGCACCTGCC 


CCCAGGTCAG 


GAAGTCCCAC 


AGCCTAGAGG 


CTCCAGCCTC 


AGATGCATAC 


28500 




ATATGTAGGC 


CCTGCCCTTT 


CCTCCTGAGC 


GGCGGGCCAC 


AGAGTCCTGA 


ACAACAGGAA 


28560 


25 


GCCCCTGAGG 


AGGGCTCCGC 


CCTGAGGGAG 


GGCAGGGGAG 


CCCCCGCCAG 


CCCCACCCAC 


28620 




AGCAGCGGGC 


CCTGCCACCC 


CCCACCCTGA 


CACCTCACCC 


CTTGGATTCC 


AGAGAGGAAA 


28680 




GTGGGCTTGT 


GTGTAGTTTA 


CATGCTCATA 


TCTTAAAATC 


ACCGTTGTCA 


ATAGAACAAT 


28740 




TCATAATAAT 


GATGATAAAA 


TAAGATTTAT 


AACCAGCTTC 


AGTCTGGAGA 


TACACACAGA 


28800 




GCAGATCTTC 


ACTCCCAGAC 


AGGGAGCCCG 


CAGCTGCCCC 


CGACCCCACA 


GGTGCAGGAC 


28860 


30 


ACACACAGAC 


AGTTCAACCA 


TGTCTTAAAC 


ACACAGGTGT 


TTATTTAATT 


GTTCATTTGA 


28920 




TTGAATTTTT 


AAGTTCACTT 


TACTACGTGG 


ATGAGATGGG 


TGCATATTAC 


AGTAGGCTTT 


28980 




CGCTATGAGC 


GCTGCCACCA 


TGAGGAATAT 


CCCAGCCCTC 


AGTTCTGCTT 


CCCTTTCTGA 


29040 




GTCCCACAAA 


AGCCAGATGT 


GGACAGCCTT 


GGGTTCCCAT 


CCCAGCTGGC 


TGCTCCTTCT 


29100 




GGGGCTGTCT 


TGGTGGGGAG 


AGGGAGATGG 


GGCAGTGGGT 


CCCTGCTGAC 


CCCTGAGCCC 


29160 
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TGCAGGGGTC 


AGGATCCTCC 


CGTGGTCCCT 


GGGTGTGGCT 


CTGGAAGACA 


CTGGCAGTGC 


29220 




CCGGCCAAGG 


CCTCCCGCAG 


GATGGAAGTT 


GAGGGCCCTG 


GCTCTGGGTC 


CTAAGAGAAC 


29280 




TCAGCCGCCC 


CCTTCACACT 


TTACAGCAAG 


GGGCCAGGCA 


GCAGCTTTGG 


GATGGGGCTT 


29340 




CCGTGGAGAA 


GTGGGGGATG 


CTGCAGTGGT 


ACAAAGACAG 


CCTCCCCCAC 


CGCCATCCTC 


29400 


5 


CAGCTGACCG 


TCCTCCAAGG 


CCAGCACTGG 


GCGTCCAAGG 


GAAAGAAGGA 


ACTCAGCCCA 


29460 




GAGGGTGTGG 


GCAGGAGAGG 


CCTGGAGTCA 


GGCCTCCACC 


CACAGCCCCC 


TCTGGGTGCC 


29520 




AAGTGGGAAG 


GGTGTTGGGG 


CTGGCTTGGG 


AACCTTACCC 


GCTGCCCTTC 


CAACACCTGG 


29580 




ATCTGTGGGC 


AGCGGTCCCA 


CAAAATCCCC 


CTTGGGGCTC 


CCTGAGGAGG 


ACTTGTGGCT 


29640 




GCCGCTTCCA 


CCAGGGCAGA 


GGGCACAGGA 


GGGGCCAGCA 


CTCCAAAGGG 


CTCTAGGGTG 


29700 


10 


GGTCTTTCAA 


GGACATCTGC 


AAAGCCCTGG 


TGGGGAGGGG 


CCTGGGCCAG 


AGGCTCTTTG 


29760 




GAACTCTTGC 


ACTTCTGAGT 


GGGGGACTGT 


CCATGCTGCC 


CACAACCTCT 


AGACCATGCA 


29820 




GCCTGCTCAT 


GGGTCCCTGG 


CAGAGAATGC 


CCACTCCCCA 


GCAGACTCAG 


GGCAGGCCCC 


29880 




CAACTGCAGG 


CTTCCAGGAA 


GGCCCAGGGT 


GTCCACCTCA 


CGCCAGGTGG 


TCTCAGAGGA 


29940 




CCCCTGTGCA 


ACCACATTAA 


GGAAAGCTGC 


AGCCCCCACC 


CACCCGCCTG 


CCAGTTCAAC 


30000 


15 


AAGCACCGGC 


TGCACACGCA 


GGCTCCCAGG 


CACCATCACC 


CCCCTCCCCC 


GTCGCCCCTC 


30060 




CCTCACGGGG 


AGCCCCTTCC 


CCCTGGAAAG 


ACAGCAGGTA 


CTGTAGCCTC 


GCCTGCTGGC 


30120 




CAGGGGCGCC 


GGCTCAGAGG 


ACCTGCCCTG 


ACCTGCACGT 


GCTGACCAGA 


CAGCCCAGCG 


30180 




TAAGGACCCG 


CGATCCCACG 


CCACCGCCCT 


GGGTTTACCA 


CGGTCACCAC 


CACCTCTCTC 


30240 




ACAGGGCCCC 


CGGGGGACCC 


AGCCGCGCCC 


GGCCTGGTGT 


CTGCACCGAG 


GGACCGCGTC 


30300 


20 


TCACGCCCGG 


CGGCTCCTGC 


AGGGGAAGCC 


GTGGTCAGCG 


ACTCACCACG 


AGGACAGGGC 


30360 




AGGGCGGCTG 


AGTGCGGAAG 


AGAAGCATGA 


AGCTGGGGGC 


GGGGGTGGGG 


GAGGAGGAAC 


30420 




AAAAGTTGCA 


TCTAGACAGA 


GGTGAACGAA 


ACAAAACCAA 


AACCCGAACG 


TGTTCCGTCG 


30480 




CAGGATGGGC 


GCCGCCCGTC 


CCGGGCCCTT 


AGCCCGACAT 


CTCTTCTCGC 


TGCTCCTTGT 


30540 




TCCTGCGCAC 


CTCGGCCGCG 


TGCAGCTCCT 


GCAGGACAGG 


GGGCGGGAGG 


GCCTGAGGGC 


30600 


25 


GGGGGTGGCT 


TGGGGCGACT 


CCGGGAACCC 


CCAGGCGCGC 


AGGCCGTGGC 


GCCCTGGCAC 


30660 




CCGCCCGGCC 


TCATCCGGGC 


TGGCCTTCGG 


CAGGACCCTG 


ACTGAGTTGA 


GGGGGCGGGA 


30720 




GCACCGGGGA 


GGCGCAGAGC 


AAGGCCAGGG 


ACCAAGGACG 


GGTTTCCTGG 


GAGCTGGCTG 


30780 




GGCCCCGCTT 


CTAGCTCGTA 


CCGGAGCCGA 


GCTTCCTTCA 


GGGCACTTTC 


AATATAATGA 


30840 




ATTTAGCCAT 


CTATTACTGC 


GGCTAGTTAC 


TGTCCCGCCA 


GGACCAGACT 


CTGGACCTGC 


30900 


30 


CTCGTGCGCT 


GCTGGGGACG 


CCCAGTAAAC 


ACGGGAGGAG 


CCCCCGACCC 


CCACCCCAGC 


30960 




TCAGCGCCTC 


GGAGTCCCCG 


GCCCCGCTCT 


GCGCCCCTCC 


GAGCTCCGCC 


CTAGCCCCGC 


31020 




CCCCGCCCAG 


TGCCCCGCCC 


CCTGCCTGCT 


GCTAGCCCTG 


CCCCCGCCCC 


GGCCCCTGCC 


31080 




CGCTCCGAGC 


TCCGCCCTGG 


CCCCGCCCCG 


GCCCCTGCCC 


GCTCCGAGCT 


CCGCCCTGGC 


31140 




CCCGCCCCCC 


GCCCAGTGCC 


CCGCCCCCTG 


CCTGCTGCTA 


GCCCTGCCCC 


CGCCCCGGCC 


31200 
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CCTGCCCGCT 


CCGAGCTCCG 


CCCCGGCCCC 


GCCCCGGCCC 


CTGCCCGCTC 


CGAGCTCCGC 


31260 


CCTGGCCCCG 


CCCCCGCCCA 


GTGCCCCGCC 


CCCTGACTGC 


TGCTAGCCCT 


GCCCCCGCCC 


31320 


CGGCCCCTGC 


CCGCTCCGAG 


CTCCGCCCCG 


GCCCCGCCCC 


GGCCCCTGCC 


CGCTCCGAGC 


31380 


TCCGCCCCGG 


CCCCGCCCCG 


GCCCCTGCCC 


GCTCCGAGCT 


TCGCCCCGGC 


CCCGCCCCGG 


31440 


CCCCTGCCCG 


CTCCGAGCTC 


CGCCCCGGCC 


CCGCCCCCGC 


ACCTTCTCGC 


GCAGCCGCTC 


31500 


GCGCAGTGCG 


GCCAGGTGTG 


CCTCGCGGAT 


CTCCTTGCTG 


AGCTCCATCT 


TGTAGTTGAG 


31560 


CTTCTCCTCC 


GCCTGGCGGC 


TGAAGTTGTT 


ATTCTCCTCC 


AGCGCCTTGT 


GCAGCACCTC 


31620 


GCGCTCGTGC 


TCGCGCCGCT 


CCGCCAGCTG 


CTTCAGCACC 


TGCGCCTCCT 


GCGTCTGTGC 


31680 


GGGGCCGGCG 


GGCGCGCGTG 


AGCGGCAACC 


CCGGGCCCTG 


CCCGGCCGGA 


CTCCTCCCTG 


31740 


CTCTCCGCCT 


CCCGCCCAGC 


GCCCGCTCGC 


CTCACCTGGC 


GCCTCCACCT 


GCCCAGGCCT 


31800 


CGGTGGGCGC 


CGGGACCCCC 


GGGCGCTGCC 


CTGGGAACCC 


TCGCCTGCCA 


TCCGGCCTGT 


31860 


GGTCGGGGCA 


GGGCCAGGGG 


GTCGCGATCC 


GCCGCCCCCG 


CCCCCGTCCC 


TGCCTCGCGC 


31920 


GCGGGTCCCG 


CGGTCCTGGC 


TGCGCCCAGG 


GCCCCCGCCA 


TACCCTGCCG 


CCACTGCACA 


31980 


CCCTGCCCTG 


CGCGTCTGCC 


CCTCCAAGGA 


CCAGCAGCAA 


GAAACCCTAA 


ACTTGTGGGC 


32040 


GGTCTCTGAG 


CTTTGTCTCT 


TCCTCGGACA 


TCCGCCCACT 


GAGCAGAGTA 


GCTGCTTGTT 


32100 


ACACACCGGG 


TTCCCAGCTC 


CCAATTAGGT 


GCCCAGGAGC 


GGAGGGTCCC 


CAGGGATGCT 


32160 


GGGGGAGGGG 


CCGGCTGGTG 


ACCCCTGGGA 


GGAGAGCGGG 


GCAGCAGGAC 


CCGCACCCAC 


32220 


ATGCCAGTCC 


CTACTAGTCA 


GCCCTGTGAA 


CCCTGGTCTC 


TGGCCTCACC 


GGGAAGGGAA 


32280 


CGGAGCCGCT 


TCCCCTGCCC 


AATGCGTTGG 


CCTCCAGGGT 


GGCACCCCCA 


AAAGGACATT 


32340 


TTTATCTCTG 


TTTCAGTCTC 


AGAGGGGCTG 


GTGGGAGGGG 


AGGCTGCAGG 


GAGGGGACCT 


32400 


GGAGCCCACA 


CCCACCTCTC 


CCAGGGCCCC 


TCCGCCCTCC 


AGCAAGCCTC 


AGGGTCTTCA 


32460 


CACATGAGGC 


CCTTCCTCCA 


GCTTCCCTGT 


CTGGGAGAGG 


GATGCCCCAC 


CCGACGTCCC 


32520 


CAGGGCCCAT 


CTGGGGACCA 


CCCCCTAGCA 


TCCTGCTGGC 


CCTGACAAGG 


GTGCCTCCCA 


32580 


CCCTCACCAG 


AGGCTCCTGC 


TCCTTCCAGG 


TGGCCGCCTC 


GGAACCCTTC 


CTCCTCTCCA 


32640 


TCCCTTTCTT 


TTTTTGTTCT 


TGTTTGTTTT 


TTGAAATGGA 


GTCTCACCCT 


GTCGCCCGGG 


32700 


CTGAGGAGTG 


CAGTGGCGCA 


GTCTCGGCTC 


ACTGCATCCT 


CCACTTCTTG 


GGTTCAAGCA 


32760 


ATTCCCCTGC 


CTCAGACTCC 


CTAGTAGGTG 


GGATTACAGG 


TGTGCACCAC 


CACACCTGGC 


32820 


TAATTTTGTA 


TTTTTAGTAC 


AGATGGGGTT 


TCACCATGTT 


GGCCAGGCTG 


ATCTTGAACT 


32880 


TCCAACCTCA 


AGTGATCTGC 


CTGCCTCAGC 


TTCCCAAAGT 


TCTGGGATTA 


CAGGCGTGAG 


32940 


CCACCACACC 


CGGCCTCTCC 


CCATCCCATT 


CTTATCTCTC 


AGAAAGAGGC 


CCAGGGAGCC 


33000 


ACAGCCCCTC 


CTGCTCCAGG 


CCAAGGCACT 


GACCAAGCCT 


GTCCGGGAGC 


ACCCTGCTTC 


33060 


TTGCAGGCCC 


TGTCCCCGTG 


GGCCGCCTCC 


GTTGAAACTC 


CTGGGGGGTG 


GGGGATGGAG 


33120 


GACTCCTTGC 


CTTCCTCCGC 


TCCTCGGCTG 


CCTCCAGCCG 


CTTTTGCAGC 


TCCTCCAGGG 


33180 


AGGTGTCCTT 


CTTCTTGGGT 


GGGGAGGAGA 


GCATAGGGCT 


CTCTGGGGAC 


AGGTCAGAAG 


33240 
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GGGACTTGAG 


GATGACCTCG 


AAGCTCTGGC 


CTGAGGCCCG 


CTTGTCCAGC 


TGCTTCACCT 


33300 




CCATGTCTGC 


AGGGCAAGAC 


CAGAGTAGAG 


CTTCAGAGGC 


CCGGCCAGGG 


CATGGCGTGG 


33360 




GCTGAGCGGG 


ATGCTCCCAG 


CACACATCCA 


ACCCCAGGGC 


TGGGCGAGAG 


GGGGTGGCTG 


33420 




CTCCCGCAGG 


AATCCCAGGC 


TTCAGCCCCC 


AGGATGGGCC 


CCTTCCCCCT 


AGAACCTCCC 


33480 


5 


TCTCCAGAGG 


CAGCCAGGAC 


GGGAGTTCAG 


AGAGACTGCC 


GGAGGCCGGG 


GGAAAAGGTG 


33540 




AGGTGGGCAG 


GCACCGCAGG 


GAAGGGCAGG 


CGGCAGCCAG 


GCACTCACCC 


CCGTACTGGT 


33600 




AGACGGTATT 


GGGGTGCGGC 


TGTGTGTAGA 


AGCAGGAGCA 


GATGAGCGAC 


AGCACCGACA 


33660 




GCTCCTTCAT 


CTTCTCCTTG 


TAGGCTGTGG 


GCACAAGGCT 


GGGCTGAGCA 


AGCACCACTG 


33720 




GGGCCTGCCC 


ACCTGGGCCC 


CCGTTTTCCC 


TCCCCATGGC 


TGCCTCTATC 


ATGTCTCTGT 


33780 


10 


GAGACACGGA 


GCTGCCCAGC 


ACGCTCTCTT 


GTGTGTCTCC 


ACACCGCCGG 


CCCCTTCGTC 


33840 




TCTCCAGCTC 


TCTCGCTTCC 


AGACGTCGGC 


ACTGTCTCCG 


TGGTGTGTCC 


CCTGCCTTCT 


33900 




GTCTCTCTCG 


CCCTCTGCCT 


CTCCCCGCTT 


TTCCTCTCTC 


TCGGCATTAA 


TGTCTGTCTC 


33960 




ATCTTCCACA 


CTGACTTGTT 


TCTCCATCCT 


TCTCCTGCCT 


GCTGTGGTCT 


GAATGTTTCC 


34020 




ATTACCCAAA 


ACTCATGTGT 


TGAAATCGTA 


ACCCCAAGGT 


GCCGGTGTGC 


GGAGGTGAGG 


34080 


15 


CATTCGGAGG 


GAATTAGGCC 


ATGAGGATAG 


AGCCCTCCTA 


AGTGGCCCCA 


GAGTGGGGCT 


34140 




TCAGAGAACT 


CCCTCACCTT 


CCATCATGTG 


AGGACACAGC 


CAGAAGACGC 


CACCCGTCTA 


34200 




TGTACCAGGA 


GGCGAGACCT 


CTCCAGGCAC 


CGACTCTGCC 


GGCACCTTGA 


TCCTGGACTT 


34260 




TCTGGCCTCC 


AGAGCGATGG 


GAAATAAGTT 


CCTGTCGTCT 


ATAAACCACT 


CAGTCTCAGG 


34320 




TACCTGCCCA 


GACTGACAAA 


GTGGCTACCC 


CTGCCTGTCT 


GGGTCTCTGT 


TTACCTTCTG 


34380 


20 


TGTGTCTGAC 


TCTGTCACTG 


TCATTGTATC 


TTTCTGTGTC 


TCTGGGGGTA 


GCCCCTGACT 


34440 




CTGTCTTTCT 


CCCTGAGTGC 


ATCTTTCTGT 


GATTCCTTGT 


CACTGTGTGT 


CTTTCTGACT 


34500 




CTTACCTCCC 


TCTGTCCCGC 


TACTTCTCTC 


TCCCCTCCTC 


CTCCTTCCCA 


CTCCTCGCCA 


34560 




GCTCAAGCAG 


GCAAGATTTA 


CTCATGACGG 


GACCAGCACA 


GATGCAAACC 


CTCTGTGGGC 


34620 




AGGACTTTCT 


TGGGCTGTAA 


ACCTGGATGA 


AGCCCTCAGA 


CCCTCCTTTT 


TCCTTCCCAA 


34680 


25 


TGATTGTGTG 


GTCACCTTGA 


GATGAAACCA 


GGCCCTCTCC 


AGGCACATGC 


TCTCTGTCTA 


34740 




TCTAGGGCTG 


GGCTTGGGCC 


ACTGATGCCA 


CCAAGGAGCA 


AGGGAGGGAA 


GCTGTCCGTT 


34800 




CAGCACCACA 


GCCAGCCCTC 


TTGCCCATTC 


AGGTCAATCA 


AGTGCCCACC 


AGCCAGTGTC 


34860 




CCTGCTGCCC 


AACCCAAACC 


AGAAGCAAGC 


CGGGCTCCTG 


TGGCCCTGTG 


CCCTGTCAGG 


34920 




GGAAGAGGAA 


GGCGCCTGCT 


GTCACAGTGA 


AAATAATTTA 


GCTCTTTTGG 


TCTATTCAGG 


34980 


30 


GCGAACCTCA 


TTCCTAAGCA 


GACACGCTGG 


CCCGGTTTCT 


CACTAGTGCT 


CGATAATCCT 


35040 




TTTGGCTGGG 


TGCAGTGGCT 


CATTTAACTG 


TAATCCCAGC 


ACTTTGGGAG 


GCCAAGGCAG 


35100 




GTGGAACACC 


TGAGGTCAGG 


AGTTTGAGAC 


CAGCCTGACC 


AACATGGTGA 


AACCCGATCT 


35160 




CTACTAAAAA 


TATAAAAATT 


AGCCAGGCGT 


GGTGGCAGGC 


ACCTGTAATC 


CTAGCTACTT 


35220 




GGGAGGCTGA 


GGCAGGAGAA 


TCGCTTGAAC 


CTGGGAGGCG 


GAGGTTGCAG 


TGAGCCGAGG 


35280 
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TCGCGCCATC 


GCACTCCAGC 


CTGGGTGACA 


GTGTGAGACT 


CCGTCTCAAA 


ACAGAAAGAA 


35340 




AAAGAGAGAG 


AGGAAGAAAG 


GAAGGAGGGA 


GGGAGGGAGG 


AAAAGAAGAA 


AGGAAAGGAA 


35400 




AGGAAGACAG 


ACAAGGCAGA 


AGTAATCAAG 


CCTTTCATGG 


TGAGCTGGGT 


CTTCTGGTGA 


35460 




CAGTGCAGAG 


AATGGTCTGT 


CCTGACTTAA 


ATTTCCTGGT 


GACCTACACT 


TTTCTGGACA 


35520 


5 


GAGCAGCACA 


GAGCCCAAGA 


GGGTGTAAGG 


AGGAGCAGAA 


AGGAATCCCA 


GGGTGGGCAG 


35580 




GCCCGTGCGA 


GAGCCTTTGG 


GGGAAGGAAT 


GAGACTTTGA 


GCCGGGAAGC 


GAGGCAAAGC 


35640 




TACCTGTCTT 


GGTCATTGTC 


TTCAGGGAGG 


GAGATGGAGG 


GGGACCAGGT 


GGGGGAGCCT 


35700 




CACAGGGGAC 


TTTGGTCTGA 


CTTGTCAAGT 


TTTCTTTTTT 


TCTTTTTGAG 


ATGGAGTCTT 


35760 




GCACTGTTGC 


CCAGGCTGCA 


GTGCAGTGGT 


GCGATCTCGG 


CTCACCGCAA 


GCTCCGCCTC 


35820 


10 


CTGGGTTCAC 


ACCATTCTCC 


TGCCTCAGCC 


TCCCGAGTAG 


CTGGGACCAC 


AGGCACCGCC 


35880 




ACCACACCCA 


GCTAATTTTT 


TGTATTTTTA 


GTAGAGACGG 


GGTTTCACTA 


TATTAGCCAG 


35940 




GATAGTCTCG 


ATCTCCTGAC 


CTCGTGATCC 


GCCCGCCTCG 


ACCTCCCAAA 


GTGCTGGGAT 


36000 




TACAGGTGTG 


AGCCACTGTG 


CCTGGCCTAC 


TTTATTTTTT 


AGAAACAGGA 


CTGTGCTCTG 


36060 




TTGCCCATGC 


TGGAGTGTAG 


GGTGCAGCTG 


TGCGGTTCAC 


TGCAGCCTTG 


AACTTCTGGG 


36120 


15 


CTTGACGGAT 


CCTGCCATCT 


TAGCAGCTGG 


GACTACAGGT 


GCATGCCAGC 


ACACCAGTTT 


36180 




TCTTTTTTTT 


TTTATCTCTG 


CTCACTGCAA 


TTCCGCCTCC 


TGGGTTCTAG 


CGATTCTCCT 


36240 




GCCTCAGCCT 


CCCAAGTAGC 


AGGGATTACA 


CGCACATGCC 


ACCACACCCG 


GCTAATTTTT 


36300 




GTATTTTTAG 


TAGAGACAGG 


GTTTCACTAT 


GTTGGTCAGG 


CTGGTCTTGA 


GCCACCGCGC 


36360 




CCGCCCGGCC 


TACACACCAG 


CTTAAAAAAA 


AGAAAAAAAT 


AGCTGGGCGT 


GGTGGCTCAT 


36420 


20 


GCCTGTAATC 


CCAGCACTTT 


GGGAGGCTGA 


GGCAGGCAGA 


TCACCTGAGG 


TCAGGAGTTC 


36480 




AAGACCAACC 


TGGCCAACAT 


GGCGAAACCC 


TGTCTCTACT 


ACAAATATAA 


AAATCAGCCA 


36540 




GGCGTGGTGG 


CGGGCTCCTC 


TAATTCCAGC 


TACTTGGGAG 


GCTGAGGCAG 


GAGAATCACT 


36600 




TGAACCCGGG 


AGGTGGAGGT 


TGAAGTGAGC 


CAAGATCGAG 


CTACTGCACT 


CCAGCCTGGG 


36660 




AGCAAGACTC 


CCGTCTCAAA 


AAAAAAAAAA 


AAATTTGTAG 


TGGTATGGAG 


GCCGGGCATG 


36720 


25 


GTGGCTCACG 


CCTGTAATCC 


CAGAACTTTG 


AGGGGCCAAG 


GCGGGCAGAT 


CATGAGGTCA 


36780 




GGAGTTCGAG 


ACCAGCCTGA 


CCAACATGAT 


GAAACCCTGT 


CTCTACTAAA 


AATAACAAAA 


36840 




ATTAGCCAGG 


CATGGTGGCG 


GGCACGTGTA 


GTCCCAGCTA 


CTCGGGAGAC 


TGAGACGGGA 


36900 




GAATCGCTTG 


AACCCAGGAG 


GCAGAGGTTG 


CAGTGAGCTG 


AGATCACGCC 


ACTGCACTCC 


36960 




AGCCTGGGTG 


ACAGAGTGAG 


ACTCTGTCTC 


AAAAACAAAC 


ACAAACAAAC 


ATATATATAT 


37020 


30 


ATACATGTAT 


ATATATAATA 


TATATATACG 


TATATATACA 


CGTGTATATA 


TATAATATAT 


37080 




ATACGTATAT 


ATACACGTGT 


ATATATAATA 


TATATACGTA 


TATATGTATA 


TATTAATATA 


37140 




TATACGTATA 


TATACACGTG 


TATATATTAA 


TATATATACG 


TATATATACA 


CGTGTGTATA 


37200 




TATTAATATA 


TATACGTATA 


TATGTGTGTG 


TGTGTATATA 


TATATGTATA 


TATATATATA 


37260 




TATATACATA 


TATATATACA 


GAGAGAGAGA 


GAGTAGTGAT 


AGGTCTTGCT 


GTCTTGTCCA 


37320 
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GGCTGATCTT 


GAACTCCCGG 


CCTCAAGAGA 


CCCTCCCACC 


TCAGCCTCCC 


AAAGCACTAG 


37380 




GATTATAGGT 


GTAAGCCACA 


GTACCTAGCC 


TATTAAAAAT 


TAATGTTAAA 


CAAGAGGATG 


37440 




TGATGAGGGA 


GTTAGAGGGT 


GTGCCAGCCA 


TGTGTTCCAC 


AGCAGCAGGT 


CAGGAGACAT 


37500 




TGGGGACATT 


TAGAGGAGCT 


GAAGAGGTGG 


CCAACCCTGT 


GCTCAGGAGG 


ACGGGGGAGG 


37560 


5 


GAGAGAGCAA 


GAGGGAGTTT 


GGGCTGGGGC 


AGAACGTACC 


TGGGTCCTGA 


GAGGATAAGA 


37620 




AGGTAGGGAC 


TTGGCCCCTC 


CAGGCCTGAC 


TCTGCCAGCA 


ACCAGCTCCC 


TATCAGCAGA 


37680 




CTCCAGGCCC 


CTACCCTTCA 


GCTCATCCTT 


CCTTATCACA 


CATCCAAAAC 


TCTGAATGTG 


37740 




GCCGGGCGCA 


GTGGCTCACG 


CCTGTAATCC 


CAGAACTTTG 


GGAGGCTGAG 


GCAGGAGGAT 


37800 




CGCTTGAGAA 


CAAGAGTTTG 


AGACCAGCCT 


AGGCAACATG 


GTGAAACCCC 


ATCTCTACTA 


37860 


10 


AAAATATAAA 


AATTAGCTGG 


GTGTGGTGGC 


ACATGCCTGT 


TGCCCCAGCT 


ACTCAGGAGG 


37920 




CTGAGGCAGG 


AGAATCACTT 


GAGCCTGGAA 


GGCGGAAGTT 


GTAGTGAGCA 


GAGATTGTGC 


37980 




CACTGCGTTC 


CAGCCTGGGC 


AACACAGCGA 


GACTCTGTCT 


CAAAAAACAA 


AAACTGGAAT 


38040 




GTGTTTACCA 


TAAAGGCCAG 


AAAATGTGAT 


TAACAGCTGC 


TCAAAGCCCC 


TGTCTGCCCT 


38100 




AAGCCTGAAA 


TTTTCACCGA 


AAAAAAGATC 


TGTAGGCTCA 


TACAGAGGAA 


GGACAAACAC 


38160 


15 


CAGGGAGGCT 


CTCTTCCAGT 


TTGCTTCACC 


TCAGCAAGCA 


GACGGCTGGC 


AGCAATTTGG 


38220 




GGGCAGGTGT 


GAGCACCTGC 


ATCATCAGGA 


AAGAAGGGGC 


ACGGTGGGGA 


CGCAGGTCAG 


38280 




ACCTCTCACA 


GGTCTTGGCT 


CTGCCCAGGA 


GACACGTGTC 


CAACTGAGAG 


GTGAGGAACT 


38340 




GGGTTCTGCA 


GCTGCAGACA 


CAGGTGCGGC 


TCAGCATCTG 


ATGGCCACGG 


AGACCCCCTG 


38400 




GCTTGGCTTC 


TCCCAGCTGG 


TGGCCCATGA 


GGAGCTTCTA 


TCCCAAGAGA 


CTGTCCCTCA 


38460 


20 


AGGAGCAAGT 


GGGACCAGGT 


ACCCACAGGA 


CGGAGCCTGG 


GAGTGAGGCC 


TGCCCTGTGG 


38520 




TCTGGCTACA 


GGGAGGAAGG 


GCAGATTGGA 


GGGGGCAGGA 


CAGCAGGTCA 


GGAATTGGCC 


38580 




AACTCTGGAG 


AGAGCAAGCA 


AGGGGAAGTC 


TGCGCACAGG 


GCAGGGCTGG 


TCAGGGGCGA 


38640 




GGCAGGGCAT 


TGGACCAGTA 


TTTTCAGAGC 


TGGTGAGGCT 


TAAAGAGCAT 


GTCTACTGCC 


38700 




TCTTATTACA 


GAGAGAGGAT 


GCCGAGGCCC 


AGACCCATCC 


AGGCCACCTC 


TCCACAGACA 


38760 


25 


CAGCTGGTGC 


CAGGGAAGCC 


CCTCCCAGAG 


CCTCAAGGCA 


TTGCTCCCTC 


TCTCTCTCTC 


38820 




TTTTTGTTTT 


TTTGGAGACG 


GAGTCTCACT 


CTGTCTCCCA 


GGCTGGAGTG 


CAGTGGTACA 


38880 




ATCTCGGCTC 


ACGGCAAGCT 


CCGCCTCCCG 


GATTCACGCC 


ATTCTCCTGC 


CTCAGCCTCC 


38940 




CGAATAGCTG 


GGACTACAGG 


CGCCCGCCAC 


CACGCCCAGC 


TAATTTTTTG 


TATTTTTAGT 


39000 




AGAGACGGGG 


TTTCACTGTG 


TTAGCCAGGA 


TGGTCTCGAT 


CTCCTGACCT 


TGTGATCCGC 


39060 


30 


CCGTCTCAGC 


CTCCCAAAGT 


GCTGGGATTA 


CAGGTGTGAG 


CCACCGCGCC 


TGGACTTTTT 


39120 




TTTTTTTTTA 


AGACGGGGTC 


TCACTCTGTC 


ACCCAGGCTG 


GAGTGCAGTG 


GCGCGATGTC 


39180 




GGCTCACTGC 


AACCTCTGCC 


TCCCCAGTTC 


AAGTGATTCT 


CCTGCCTCAG 


CCTCCCAAGT 


39240 




AGCTAGAATT 


ACAGGCACAT 


GCCACCATGC 


CCAGCTAATT 


TTCTGTATTT 


TTAGTAGAGA 


39300 




TGAGGTTTCA 


CCATGTTGGC 


CAGGCTGGTC 


TTGAACTCCT 


GACCTCCGGT 


GATCTGCCCA 


39360 
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CCTCAGCCTC 


CCAAAGTGCT 


GGGATGACAG 


GCGTGAGCCC 


CCGCGCCTGG 


CCCCCCGCAG 


39420 




TGCTGGGATT 


ACAGGCGTGA 


GCCCCCGCGC 


CCGGCCCCTC 


CCTCTCTTTG 


ACTCCCTTCT 


39480 




TTCTCACCGC 


CCCCTCCCCA 


CCATCCTTCC 


CCTTCACTGA 


CTTCAGGGAG 


TTAAAAACAA 


39540 




TTCTCGCAGT 


GAGCTGGGCT 


TGTTTTGTCT 


CCCTGCTTCT 


CTTTGTACTA 


AACATTAGAT 


39600 


5 


ACCGAGGAAA 


TGCGGATTGG 


CCTTTGGATG 


ATTCATGAGC 


AGGAGTCAGA 


AAAAGGCACC 


39660 




AGGTTGGCCT 


CAAGCAGCAG 


GGTATAGTAG 


TGCCCGCTCC 


CAGGGTCACA 


CCTCACGCCC 


39720 




ACCCCTCCCG 


CCGTCCAGGT 


GGATGGTGCC 


CACTCCCAGG 


GTCACACCTC 


ACGCCCACCC 


39780 




CTCCCGCCGT 


CCAGGTGGAT 


GGTGCCCACT 


CCCAGGGTCA 


CACCTCACGC 


CCACCCCTCC 


39840 




CGTCGCCCAG 


GTGGATGGTG 


CCCACTCCCA 


GGGTCACACC 


TCACGCCCGC 


CCCTCCCACC 


39900 


10 


CACCCGGGTG 


GATGGTGCCC 


GCTCCCAGGG 


TCACACCTGA 


CGCCCACCCG 


GGTGGATGGT 


39960 




GCCCGCTCCC 


AGGGTCACAC 


CTCACGCCCA 


CCCCTCCCGC 


CCGCCCGGGT 


GGATGGTGCC 


40020 




CGCTCCCAGG 


GTCACACCTC 


ACGCCCACCC 


CTCCCGCCGT 


CCAGGTGGAT 


GGTGCCCACT 


40080 




CCCAGGGTCA 


CACCTCACGC 


CCACCCCTCC 


CGCCGCCCAG 


GTGGATGGTG 


CCCACTCCCA 


40140 




GGGTCACACC 


TCACACCCAC 


CCCTCCCGCC 


CACCCGGGTG 


GATGCCCTTA 


TCAGCTCTCC 


40200 


15 


TTCTCCTTCT 


CTTTCGTCTT 


CTTCGTCTTC 


CTCCTCTTCT 


TTCTTCTTTT 




40260 




TAGAAAGAGT 


TTCTACTCTT 


GCTGCCCAGG 


CTGGAGTGCA 


ATGGCACAAT 


CTCAGCTCAC 


40320 




TGCAACCTCC 


CTCTCCCCGG 


GTCAAGCAAT 


TATCCTGCCT 


CAGTCTCCCA 


GATTGCTGGG 


40380 




ATCACAGGAG 


TGTGTCACCA 


CACCTGGCTA 


ATTTTGTACT 


TTTAGCAGAG 


AGGGGGGATT 


40440 




TCACCATGTT 


GGCCAGGCTA 


GTCTCGAACT 


CTTGACCTCA 


GTTTATCCAC 


CGGCCTCAGC 


40500 


20 


CTCTCAAAGT 


GCTGGGATTA 


CAGGCATGAG 


CCACCCTATC 


TGCCTCACTT 


CTACAGAGGA 


40560 




GGAATGAAGG 


CTCAGAGAGG 


GCAAGCATTC 


CACCCAGCAT 


CACACAGAGT 


GCCGGGTGAG 


40620 




AGCCCAGTCA 


TGAGCCTGGG 


CCTGACTGCA 


GGCTCCTGTT 


GGGAGCTCGC 


GGAGGTGGGG 


40680 




GATCTGTCCA 


GAACTGAGAG 


GCCAGGGGAC 


CACAGTGGCC 


TCTGACCCCT 


GGAGGGCCCT 


40740 




GGAGGCTGCT 


GCCGGCTCCC 


CCCGGGGGCA 


GATGGAGGTC 


ACTGTCACCC 


AGGCTGCTTC 


40800 


25 


TCATGGTGCC 


AGGAGCACAG 


CATGGCAGGA 


GCCACCAGCC 


GATTTGCCTT 


TCCCTGGGCA 


40860 




GGAAACTCAG 


AAATGTGGCT 


ACCACAGTCA 


GGCTGCTTGA 


CGTGCGGTGA 


GCACTCATCT 


40920 




CTTAGCAGGC 


AAGCGGCCAA 


GCACCTTTCC 


TGAAATATTG 


AGGCCTCAGA 


ACAAGCCCCA 


40980 




GGAGAGGTGC 


CAGCACCGTC 


ATCTCTACCC 


AGATAAGGAG 


ACCCAGGTCC 


TGAGAGGTTA 


41040 




GGCAGCTCGG 


ACAACACCAC 


ACAGCTGGAG 


GAGGTCAGAC 


TCTGGGTTGC 


AGAAGGAGAA 


41100 


30 


TGTGAGCAGA 


GGCCACAAAA 


GAGCGAGGAG 


CCAGTGCCCA 


GATGCCGAGA 


TGCCCTCGCC 


41160 




CTCCCAGCTC 


AGCCCCAGGA 


ACCGAGCCCA 


TGGGGAGGGA 


CCGTCAGGGA 


AAGGCTGTCA 


41220 




GGAAGGGCAG 


GAGGCGGCCC 


TGGAGAGGAC 


GGCGCTGCCC 


TCAGGGGCAG 


GAGGGGAGTC 


41280 




CCCTCCGCTG 


AGAGCCCCCC 


CACCCCCAGT 


ATCCCCGGGG 


GTGTCCAGGA 


GGAGGCGGAG 


41340 




GGAGGAAGCG 


CAGATGGACA 


GGACTCCCAG 


ATAGGGTGGG 


GAGGTGTGGC 


CGGTGACACA 


41400 
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CACGGTCCCC TCCTGGCAGG 
GGGGCGGGGT CAGGTCGGGC 
CCCGAGTTGA ACCAGTCAGC 
AAAAGGTGGG AGGGAGTGCG 
5 TTCGTCCTGT CCTCCCAAAC 
GAGCAGAGCG TTTCGGGAAG 
TCCCCTGCGT CCAGAATCCG 
CGCGCGGGAC GGGCCGGGAG 
CGCGGGAGGT GGAGGGGCGG 

10 GCAGTCGCCC CTCCCCCATC 
AGCAGGTGGC GCGGGCGTCG 
GCGCGGTCCC CCTCACTCCG 
CCTCCAGCCC CTGTGCTGCA 
AGAGGGTCCC ACCCTCTCCC 

15 CGCCGGACGC CCCAGGGACC 
CACGCCCGGG TCTGGGAGCC 
CAGGGGCGAG GCGCCCCCGC 
GCAGCCCAGC GCCCCACCAG 
CCCAGGCTTG CAACGCGCAG 

20 CCCAGGCGAG GCCAGACGAG 
CCGGAGCCTA CCGGAAATGG 
TGGAGAGGCG CAAGTGGCGG 
GAGGGGAGAG GAGTGGAGGG 
AGGGAGGGAG ACAGAGGGAG 

25 GGGAGGGTTG GGGGAAGGAG 
AGGGAGGAAG GGAGGGAGGA 
GGCTCTCAGG GGGAGAGGGC 
CCTCCAACCG CTCACAGCGG 
TGGGGTGGGG GTGGAGATGA 

30 GAGGAGCAGG TCCCACCCAT 
AGGCTGGGCT GGCGCCAACA 
CCACAGGGGT GGGCGCTGGC 
TGGGCAGGAT CTGAGCGGAG 
GCCGTGAAAG AGGTGGTTTC 



TGCTGAAGTC ACCTGGAGCC 
ACGCGTGGGT GGGCGGAGTT 
TCGGGAGAGG GACCGCGGCG 
GCTCGTGAAC GGGGGCGGCG 
GTCGAGTGAA AAACGAAGCG 
GGCGGGCCCA GCGTCCTCGC 
CCCCCCGCCC GGGCCTGCGC 
GCCGGGGTGG GCGCTACCTT 
GAGGGGCGGA GCCCTCTGGT 
CAGACCCCGC GGCGCAAAGG 
CAAAGGGTGG TCCCCGAGGC 
GGCTCCGCCG TGTCTGGCCC 
CTGGCGCGGG GAGCGCCGGG 
CGCCTCCCCA CGAAGGCTCT 
CCGCCCGCAC ATCGCGAGCG 
ACCCTGCGGC AGTCGCGCCC 
CCGACGTCCC GGTCCCGAGC 
CCCCGCCGGC GCCGCAGACC 
GGTAGGAGAA GGGAAATTGG 
GCCTCTGCTC AGATCCCGCC 
TGCTGGCCAT GGTGCTGGCG 
CCGGAGCTGC AGACGGCTGG 
AGCGAGGGCG GGCGGGAGGC 
AGAGAGAGAG GGTTGGGGGA 
AGAGAGAGAG AGAGAGACTG 
AGAGAGAGAG GAGCAAGCGC 
GGAGGAGGGG GGCTACCCGA 
GACAGCTGCT TCTCCAACTT 
GCCCCCATTC CCCAGGGCAG 
GGGGTGGGGC CGCAGAGCTC 
GGGTCCAGGT CTGTGCCTGC 
CTGGAGGCCT CCAAGGGGCA 
AAGAGTGAAA GTGGAGGAGT 
CCGCCTGGGC TCAGACCTTC 



TCCAAGCCCG 


TGGGGCCTGA 


41460 


CTGCGCCCCG 


GGCCAAGGCG 


41520 


ACCTGTCCCG 


GGGGCGTAAG 


41580 


ATGGGAAGGA 


GGTGCGGCCC 


41640 


GGTTCTGCGG 


CCTCGCGGCG 


41700 


GCCCGAGGTC 


GCCCGGCAGC 


41760 


CCGCCCCTCC 


GCCTGAGCTC 


41820 


CGAAGGCGGT 


GGGTCCGCCC 


41880 


CTCCGGAGGG 


TTTGGGGATC 


41940 


GCAGTGGCTT 


TTCTGGCCAG 


42000 


CGCAGCGGTG 


TGGGGGGAGG 


42060 


GCCCCCCTCC 


TTCAGCGCCC 


42120 


TTCCCGGCTG 


GGGCTTTGGC 


42180 


GGCGGACCCA 


GATCTCGGGT 


42240 


CGCCCACCCG 


GTCGCGAGCC 


42300 


TGCGTGGCAC 


GCTGCTCCCC 


42360 


GCTCCCCGGC 


GCGGCGCCTC 


42420 


CCAGCCTCGG 


GCGGGTCGGG 


42480 


CGTCCGCTGC 


CGGCCGCTGC 


42540 


GCCCCACAAA 


GCCCGTGGCC 


42600 


GCGGTTGGGC 


CTGCGGAGGC 


42660 


TGCTGCAGTG 


CCGGGGAGGG 


42720 


GGGCGCGGCG 


GGAGAGAGAG 


42780 


AGGAGCGGGG 


GGAGGAGGGA 


42840 


CGGGGGCGGG 


GGAAGGAGGG 


42900 


CTGGCTGCGG 


AAGGGGCCGC 


42960 


ACTGCAACAA 


GACCCCCCAC 


43020 


GGCTTTGTGA 


GGCCTGAGAG 


43080 


GCGGGGCAGG 


GGCAATGCCG 


43140 


TTCGCCGCCA 


AGGCCGCTGT 


43200 


CATCGGAGAG 


GATGCCACAG 


43260 


TCTCCTGTGA 


GCCCAGGGGA 


43320 


GAGGCCAGAA 


CAAAGGCTTT 


43380 


ACTCACTGTG 


TGGCCCAGGC 


43440 



-42- 



WO 01/42434 



PCT/US00/33065 





CAAGGGCAAG 


CGTCTGACCT 


CGCTGGGCCT 


TTGTTTCTCA 


GGGGTAAGAT 


GAAACAATGA 


43500 




TGCCCCCAGA 


CGATGGAGAG 


GAGGGGTGCC 


AGGGTTGTGC 


GCACTTAGTG 


AGTGGGGGGC 


43560 




AACCTATCCT 


GCCTCCCCCT 


CTCCTCATAA 


CTCCCAAAGG 


GAAAGCCTGG 


TAGGCAAACG 


43620 




GAGCGTCTTT 


GCCATTGCAG 


GGATGAAGCC 


ACCGAGGCAG 


GGAGAAAAGT 


GCTTTGCCCT 


43680 


5 


ACAAGCAACT 


AAGTCATAGG 


GCCAGGAGCA 


AAACCCTGAA 


AACCTCAGGA 


GACTTGCAGA 


43740 




GCCATGAGGC 


TGGCTCAGCA 


ACACAAAAGC 


CAGGGGCAAG 


CCTCAGCTCT 


AGCAGTGCGG 


43800 




TGGGAGCACC 


CAAGGCCAGT 


CACATCCTAG 


GGTGGCCTGG 


AGAGTCCTGA 


CCCCTGACGT 


43860 




GCAAGCCGGC 


ATCATCCCCG 


GGACTGTGAG 


TCTGGTGGGG 


GTGATGCCCA 


GGAATGTGAC 


43920 




ATTGTGTGGC 


CCAGAGGTAC 


CCTTAAGACT 


GGAGGATCAC 


CAGGCGGGCC 


CTGACCTCAT 


43980 


10 


CACAGGAGCC 


CTTTAAAAGC 


AGTTTCCTTT 


GCCTGGTTGA 


AGAAATCGGA 


GGGATCAAAC 


44040 




CAAAGAAGGT 


TTTCTGTTGT 


TGAGATGAGG 


GGGCCACGTG 


GCAAGGATCT 


GAGAACTGCT 


44100 




CCCAGCCAAC 


AGCCAGCAAG 


ACAACAAGAC 


CTTAACTGCA 


AGGAAGTGAG 


TTCTGCCAAC 


44160 




AAGAAGAGAA 


TGGGCTTGGA 


GGCAGGTTTG 


ACCCCAGGGC 


CTCCACACAA 


GAACTGAGCC 


44220 




CAACTGCCCA 


CTTGGTTTCA 


GCCTTGGGTT 


ACTAAGAATT 


AGGAGGTAAT 


GAATGAGAGT 


44280 


15 


TGTTTTAAGC 


TGTTGGTTTT 


GTGGTGATTT 


GCTATGAAGC 


CATATCAAAC 


TAATATACAC 


44340 




ACAGAGGTGT 


TGGCCCCTGG 


GCCATTCCTA 


GGAAGCCAGC 


TCTGCGAAGG 


AGGAAGAAGG 


44400 




GCAGAGAGGC 


ACACAGAGCT 


GCCCACCACA 


GCAGCTGTGT 


CCTCCCTGTT 


GGCCACCACA 


44460 




GTAGCAGTTG 


GGGATGGTCA 


GCATCCTTCA 


GGCAGACTCC 


AGCCCCGGGT 


GCTGGAGCTC 


44520 




AGGTGCTAGG 


GATCAAGAGA 


AGTAGCCCTC 


TCTGGGACCT 


CCAGAGTCTT 


CTCATGTGGG 


44580 


20 


TGGGGTAGGA 


CCCACCCAGT 


CAGGCTCAGA 


GCACCGCAAT 


GCCTCACACT 


CATTGTGACT 


44640 




CTGGCCAGGC 


CCTCTCTGAG 


CCTCTGTGTC 


CTCATCTGGA 


GCACAGGGAC 


CAGGTGTGTG 


44700 




GAAGCCCGTG 


GCATAGTGCC 


AGGAACACAG 


TAGATGTGCA 


CAGTGTGCAC 


TAGCAGGAAC 


44760 




ACACAACAGG 


GGTACTGACT 


GTCAGCACCT 


AGGCAGGCAC 


ACGCAATGGG 


GTACTGACTG 


44820 




TCAGCCATAC 


TGACTGTCAG 


CGTGCTAGCA 


GGCATACACA 


ACAGCTGTAC 


TGACAGCACA 


44880 


25 


CTAGCAGGCA 


CATGCCATAG 


GTGTACTGAC 


TCTCAGTGCA 


CTGGCAGGCA 


CACGCAATAG 


44940 




GAGTAATGAC 


AGCATGCTGG 


CAGGCACACA 


ATAGCTGTAC 


TGACTGTTTG 


CCCCAATATA 


45000 




GTGCCAGGTC 


TTGGAGCAGA 


TTTTGACTTC 


TCACCAAGAT 


CAAATGCAGA 


AAGTGCACGA 


45060 




GCATTTCAAA 


GATGTTTTTC 


ACATGCACAT 


TAGTGCTAGT 


TAAAAAAATG 


TTTTGACTGG 


45120 




GTGCAGTGGC 


TCACAACTGT 


AATCCCAACA 


CTTTGGGGGG 


CCGAGGTGGG 


CAGATCACCT 


45180 


30 


GAGGTCAGGA 


GTTTGAGACC 


AGCCTGGCCA 


ACATGGTGAA 


ACCCCATCTA 


CCCTAAAAAT 


45240 




ACAAAAATTA 


GCCAGGTGTG 


GTGGCAGGTG 


CCTGTAATCT 


CAGCTACTTT 


GGAGGCTGAA 


45300 




GCAGGAGAAT 


CACTTGAATC 


CAGGAGGCAG 


AGGTTGCAGT 


GAGCCGAGAT 


CCCACCACTG 


45360 




CACTCCAGCC 


TGGGCAACAA 


TATCAAGACT 


CCACCTCAAA 


AAAAAAAATG 


TTTTTCATAA 


45420 




AGTGTGACTT 


TTATCAGACC 


TCTGCATTCT 


TGAAATTAAC 


TCTGGCTTGG 


CTGGGCGTGG 


45480 
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TGGCCCACAC 


CTGTAATCTT 


AACACTTTGG 


GAGGCTGAGG 


TGGGCAGATC 


ACGAGGTCAG 


45540 




GAGTTCAAGA 


CCAGCCTGAC 


CAACATGATG 


AAACCCCATC 


TCTACTAAAA 


ATACAAAAAT 


45600 




TAGCCGGGCG 


TGGTGGCATG 


CACCTGTAAT 


CCCAGCTACT 


CAGGAGGCTG 


AGGCAGGAGA 


45660 




ATCGCTTGAA 


CCCAGGAGGT 


GGAGGTTGCA 


GGGAGCCGAG 


ATCGCACCAC 


TCTATTCCAG 


45720 


5 


CCTGGGCGAC 


AGAGCAAGAC 


TCTGTCTCAA 


AAAAAAAAAA 


GAAAGAAAGA 


AATTAACTCT 


45780 




GGCTCCTAGA 


AGGAGCCCTA 


TATCTCAGCA 


GGACACTCAG 


TCATTCAACA 


GACATCTGTC 


45840 




AAGCACCTGC 


TGTATGCTGG 


AGCTGTGGGT 


ACGTCAGCAA 


TTAGAGGAAG 


AGGGCAGGGG 


45900 




TACAGGAGTT 


CCTGACCACC 


CCAGGCCAGC 


ACGCTCCTAT 


AGCAGCTGGC 


AAGGAGCAGA 


45960 




TGACTCAGAC 


TTCAGCTCAG 


TCCACAGGAC 


AGCCTTTTCT 


GGCCACTGCT 


CTCAGGAGAT 


46020 


10 


GAGATGTGTG 


GCTGCAAAAG 


GTAAACTCCT 


GGCTCCTGAG 


CAGGCTCTGG 


GCAATCTGCT 


46080 




CAACGCTCTG 


TGCCTCACTT 


TCTCACCCAG 


AAAGTGTGGA 


CAATGAGAGG 


ACTTATCTGG 


46140 




CTGGGCGCGG 


TGGCTCACGC 


CTGTAATCCC 


AGCACTTTGG 


GAGGCCGAGG 


CGGGTGGATC 


46200 




ACCTGAGGTC 


AGGAGTTCAA 


GACCTGCCTG 


GCCAACACGG 


TCAAACTCCA 


TCTCTACTAA 


46260 




AAATATAAAA 


AATTAGCCGG 


GCTTAGTGGT 


GCACACCTGT 


AATCCCAGCT 


ACTTGAGAGG 


46320 


15 


CTGAGGCAGG 


AGAATCACTT 


GAACCCAGGA 


GGTGGAGGTT 


GCAGTGAGCC 


AAGATTGTGC 


46380 




CACTGCACTC 


CAGCCTGGGC 


AAAAAGCCAA 


AACTCTGTCT 


CAAAGAAAAA 


AGAATCATGG 


46440 




CAGAAGGTGA 


AGTCTATGTT 


AGTCCCAGTT 


CCCAGGTCGT 


ACATGGCGGC 


AGGAGAAAGA 


46500 




GAGAGAGAAG 


GGGAAACTGC 


CACTTTTAAA 


CCATCGGGTC 


TCCTGAGCAC 


TCACTGTCAG 


46560 




AACAGCCTGG 


AGGAAACTGA 


CCGCATGATC 


CAACCACCTC 


CCTCCAGGTC 


CCTCCCTCCA 


46620 


20 


CACGTGGGGA 


TTACAATTCG 


AGGTGAGACT 


TGGGTGGAGA 


CACAGAGCCG 


AACCATATCA 


46680 




GCATGTATGG 


GGGGCACTGA 


AACTTGTGCT 


TGGTGCCCAT 


TCATTCAACG 


AGTGTGTGTG 


46740 




GCTGGTCTCC 


TCATCTTCAA 


CTCCCTGCCG 


AGTCTCAGAT 


AGGCAGCCTG 


CAGTTCCTTC 


46800 




ACCACAACAG 


GCACATGGGG 


CTGGGTGCCA 


GTGAGTGCTG 


GGGCTTCTCC 


GAGCACTATC 


46860 




TCACACCCAG 


GAGCGTGGGC 


ACGCATGGCA 


TTCGCATGTG 


CCGTCAGTGG 


ACATTAAACA 


46920 


25 


CAGCCATGAA 


GAAGCCACGA 


AGAAGTGCTG 


CCTGCCGGCC 


GTGCGCGGTC 


ACGCAGCGCC 


46980 




AACTCCCTCC 


TGGGGCCTTC 


TGGGGCCTTC 


TGGGGCATGG 


GAGCTGGGGC 


CGCCTGAGAC 


47040 




AAACATCCGT 


GACGCTGGGC 


TGACCCCACA 


GAACGGTGCG 


GGCCTCGCTC 


TTGGAGTCAG 


47100 




CCCTGCTGCC 


AGCCAGTGCC 


GGGTGCTGGG 


GACTCAGGGA 


GGCCCGCCGG 


GACCACTGCG 


47160 




GGACAGTGAG 


CCGAGCAGAA 


GCTGGAACGC 


AGGAGAGGAA 


GGAGAGGGGG 


CGGTCAGGGC 


47220 


30 


TCTCAGGAGC 


CGGGTCCTGG 


GCAAGGCGCA 


GCCGTTTTCA 


AATTTTCAGG 


AAAGCGGTCG 


47280 




GCTCACACTC 


GAGCAGTAAA 


AAGATGCCTC 


TGGGGAGGAG 


GCCCGTGCAG 


CTCTCCGGGC 


47340 




AATGGTGGTG 


GCTCGGCCTA 


GAGAGGCGGT 


AGTGGAACGC 


AGACCCTGGT 


GGGGGAATGA 


47400 




CATCAAGGGA 


GGAGACGGGC 


GGGACCCCAG 


ATTTCTGCCT 


GTGGGCGATG 


GAAGTGAGGT 


47460 




TCACTGGCCA 


GCGGAGCCGG 


ACACAGAACG 


CGCAAAACGC 


CGTGTAGGCC 


TGGAGGAGCC 


47520 
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GAAGAGCAGG 


CGGACCCCCT 


CCGCGGGGGA 


ACAGTTTCCG 


CCGGGAGCAC 


AAAGCAACGG 


47580 




ACCGGAAGTG 


GGGGGCGGAA 


GTGCAGTGGG 


CTCAGCGCCG 


ACTGCGCGCC 


TCTGCCCGCG 


47640 




AAAACTCTGA 


GCTGGCTGAC 


AGCTGGGGAC 


GGGTGGCGGC 


CCTCGACTGG 


AGTCGGTTGA 


47700 




GTTCCTGAGG 


GACCCCGGTT 


CTGGAAGGTT 


CGCCGCGGAG 


ACAAGTGAGC 


AGTGAGTCGC 


47760 


5 


AGTGACCCTA 


CAAGTGGTTC 


TTTTACCCGA 


GCGGCTCGTA 


GGCGCGTTGC 


GGTTTTTCGA 


47820 




AACTACAGCT 


CCCGGCAGGC 


CCCAAGCCGC 


CCTCGGGGCC 


GCGGGTCGGC 


GGATTGGCCG 


47880 




CGCTGCATTT 


TGGGACCTGT 


AGTTTCCTGC 


GCTCGTGGCG 


CTGGCGCCGC 


GGCGTTGGCT 


47940 




GAGCCCTTGA 


CCGGGGCTGG 


AGGGAAGGGC 


CGACATTCAG 


TGTGTCCGCG 


TCTGTTCTGT 


48000 




TAGTCCCAGT 


TCCCGGGCGG 


GATTGAGGCT 


TAGAGAAGTT 


GAGTGATTTG 


CTGAGGGCTG 


48060 


10 


CACGGGTTGG 


CATCCCGGCA 


TGCTCTTTCG 


CTACTTTGGC 


TGCATCTGGT 


TGCCCACCCG 


48120 




GGCGGATGGG 


GAATGGACTC 


CAGCCAGCCA 


GGAGGGCAGA 


GGGCTGGAGA 


GGCAGGGCCG 


48180 




GAGGTTCAGA 


CCCTCCGCTC 


TGACGTTGCG 


CCTGGTGAGG 


CCGGGAGGGG 


TGCCGCTTGC 


48240 




CTCTTCAGCC 


CTCACGCTCT 


TGTGGAAGTC 


GCGGAATTAC 


TGCAGGCGGA 


ACTTGCAGCA 


48300 




CTGTGGGCGT 


CTTTTCCAGA 


GAAGGACGGA 


GTTGTGGGGC 


GGGAGGATAA 


GGCAAGGCCC 


48360 


15 


AGCCACTTCG 


CATCTTCGCC 


CCGCCAGCTC 


CTCGAGATGG 


GATATACCAG 


GGTTGCTCTC 


48420 




CAACCCTCTC 


CGCAGGAGGG 


ACTGATGGAA 


ACGCCTGGGA 


AAGTAGCCCG 


GTACCCACAA 


48480 




AGGCTGTCTA 


CAAACAGAGT 


CTTACTGTCT 


TTCCCAGGTC 


TGTGCCATAG 


GGATTCTCGA 


48540 




AGAGAACAGC 


GTTGTGTCCC 


AGTGCACATG 


CTCGCATCGC 


TTACCAGGAG 


TGCCCGAGAC 


48600 




CCTAAGATGT 


TCGGAGTGGT 


TTTTTCGCAC 


AGACCCGAAT 


AGCCTGCCCC 


TCAGCCACGC 


48660 


20 


TCTGTGCCCT 


TCTGAGAACA 


GGCTGATATG 


CCCAAGATAG 


TCCTGAATGG 


TGTGACCGTA 


48720 




GACTTCCCTT 


TCCAGCCCTA 


CAAATGCCAA 


CAGGAGTACA 


TGACCAAGGT 


CCTGGAATGT 


48780 




CTGCAGCAGG 


TAGAGCACAG 


GCCCCGAGGA 


AAGGACTGCG 


GGTGGGTGGA 


GCTTCAGCCA 


48840 




GGACGGGGTG 


TGCTTCCCTC 


TCCCGGCCCA 


TTCCAGCCAG 


GCCCCTCCGG 


GCCAGAGGCA 


48900 




GCGTCTGTCA 


TAAAAAGGGC 


TGGTGTTCCA 


GGTGGGGTCA 


GAGAGAGGAT 


TGACAAGTAA 


48960 


25 


AAACGATCGT 


CCTTTGAAGG 


GGGCCGGCCC 


CTCCACACCT 


GTGGGTATTT 


CTCATCAGGC 


49020 




GGGACGAGAG 


ACTGAGAAAA 


TGAATAAGAC 


ACAGAGACAA 


AGTATAGAGA 


GAAAAGTGGG 


49080 




CCCAGGGGAC 


CGGCGCTCAG 


CATACAGAGG 


ACCTGCACCG 


GCACCAGTCT 


CTGAGTTTCC 


49140 




TCAGTATTCA 


TTAATTACTA 


TTTTCACTAT 


CTCAGCAAGA 


GGAATGCGGC 


AGGACAGCAA 


49200 




GGTGATAGTG 


GGGAGAAGGT 


CAGCAAGAAA 


ACGTGAGCAA 


AGGAATCTGG 


GTCACAAATA 


49260 


30 


AGTTCAAGGG 


AAGGTACTAT 


GCCTGGATGT 


GCACGTAGGC 


TAGTTTTATG 


CTTTTCTCCA 


49320 




CCCAAACATC 


TCGGTGGAGT 


AAAGAGTAAC 


AGAGCAGCAT 


TGCTGCCAAT 


ATGTCTCGCC 


49380 




TCCTGCCACA 


GGGCGGCTTT 


TCTCCTATCT 


CAGAATTGAA 


CAAATGTACA 


ATCGGGTTTT 


49440 




ATACCGAAAC 


ATTCAGTTCC 


CAGGGGCAGG 


CAGGAGACAG 


TGGCCTTCCT 


CTATCTCGAC 


49500 




TGCAAGAGGC 


TTTCCTCTTT 


TACTAATCCT 


CAGCACAGAC 


CCTTCACGGG 


TGTTGGGCTG 


49560 
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GGGGACTGTC 


AGGTCTTTCC 


CATCCCACGA 


GGCCATATTT 


CAGACTATCA 


CATGGAGAGA 


49620 




AACCTTGGGC 


AATACCCGGC 


TTTCCAGGGC 


AGAGGTCCCT 


GCGGCTTTCC 


GCAGTGCATC 


49680 




GTGCCCCTGG 


TTTATCGAGA 


CTGGAGAATG 


GCGATGACTT 


TTACCAAGCA 


TACTGCCTGT 


49740 




AAACATATTG 


TTAACAAGGC 


ATGTTCTGCA 


CAGCTCTAGA 


TCCCTTAAAC 


CTTGATTCCA 


49800 


5 


TACAACACAT 


GTTTCTGTGA 


GCTCAAGGCT 


GGGGCAAAGT 


TACAGATTAA 


CAGCATCTTA 


49860 




GGGCAAAGCA 


ATTGTTCAGG 


GTACAGGTCA 


AAATGGAGTG 


TGTTATGTCT 


TCCCTTTCTA 


49920 




CATAGACACA 


GTAACAGTCT 


GATCTCTCTT 


TTCCCTACAG 


TCCTTGAGGG 


TGACAGACTT 


49980 




AGGAGTGCCT 


TGGGGGCCTC 


TCTGAGGAGC 


AGCTGATATT 


CACGGGTCAG 


GAGGAAGCAT 


50040 




TTCCATTAGA 


GGGGCAGCCG 


GTGGCCAGCC 


TCACTTGGAA 


GGTCTTTGAA 


CCTCGGGGGT 


50100 


10 


GCAGGGAGGT 


GGCAGTGGTG 


CAGGTTGCCT 


TCTCCTGGGT 


TCCTTGAGGT 


GCCCTCTTGT 


50160 




ACCCGGCTCA 


CACCCTTCCC 


CTCCCCGAGT 


TTCCTGCTCA 


GGTTCCCGTC 


TGAGAGCTTG 


50220 




TATGTAGGAC 


GTCAGATAGG 


ACAGCATAAA 


TGTTTGGATC 


CAGAAACGCA 


GAACAGTTTC 


50280 




CTATTTTGAG 


ACTTGACACC 


TAATTAGTCA 


TCTTACTATT 


TAAGCTGAAA 


AATAGTGTCG 


50340 




TGTTTTGGGT 


AACGTTCTGC 


AAATCGTTTG 


CTAATGGCGG 


CTGAGTTGCT 


TCACGCCCTT 


50400 


15 


TAGGGCAAGA 


GTGGGACTTG 


CCTGTGGACT 


TCTCCGCGGT 


CCCACAGGGC 


TCTCGCCACC 


50460 




TGGCAGTGGC 


CTCTGCATCT 


GCAAAGAGCT 


GCCCGCTGGC 


TGCCGAAGCT 


TGTCTCAGGG 


50520 




CAGCTTGTGT 


GGCCTCGCCT 


CTTCCTGGCT 


TCCCCGTAAC 


CCTTGCTCCG 


AACTCCGTTC 


50580 




AGAAGGTGAA 


TGGCATCCTG 


GAGAGCCCTA 


CGGGTACAGG 


GAAGACGCTG 


TGCCTGCTGT 


50640 




GCACCACGCT 


GGCCTGGCGA 


GAACACCTCC 


GAGACGGCAT 


CTCTGCCCGC 


AAGATTGCCG 


50700 


20 


AGAGGGCGCA 


AGGAGAGCTT 


TTCCCGGATC 


GGGCCTTGTC 


ATCCTGGGGC 


AACGCTGCTG 


50760 




CTGCTGCTGG 


AGACCCCATA 


GGTGACCCTA 


GTTCCCAGGC 


CTCTCCTGGC 


CTCCTGTGGG 


50820 




GATGGTTGGC 


AAGGGATGGC 


GCTGAGGGTG 


GGGTGGGCCC 


ATGGGGACTC 


CTGCCGTCTC 


50880 




TCAAGCAGAA 


CTCAAGGAGA 


ATTTTTTAGC 


TGCTGTATAA 


TTTCTCGCCA 


TCGTGGGTGT 


50940 




AAACCTAGGG 


TTGGGCTTTT 


TTGCTGAATT 


AGGGCACGGC 


AGATGCCCAC 


TTCACCCATT 


51000 


25 


TTTGATAAAC 


CAGTATCTGG 


GGTGTCAGAT 


TCTTGGCTGT 


CTGCAGGGCC 


GAGTTAGCCG 


51060 




AATGCCACCT 


GCCTTTGATA 


CGTGAGAACG 


TTGTCTGAGA 


ACCGTGACTT 


CTGTGCTTGC 


51120 




TTGTGTCTGG 


TCAGCTTGCT 


ACACGGACAT 


CCCAAAGATT 


ATTTACGCCT 


CCAGGACCCA 


51180 




CTCGCAACTC 


ACACAGGTCA 


TCAACGAGCT 


TCGGAACACC 


TCCTACCGGT 


GGGTCAGACG 


51240 




AGTTTACACC 


TGTCTCGGGG 


TCCTCAAGAG 


AACCAGCTTG 


GCATGGTGCT 


GAGTCCACAG 


51300 


30 


CCCCATGCTG 


TGCTGTGGTG 


GAGGGTGGTG 


GTCTTTCTAG 


ACGCTCCCCC 


GAAGTGTGCA 


51360 




GAGCGCTGGT 


GCCCAGGGGT 


GGGGTGCGGC 


CTGGGCTGCC 


TCCAATGCCC 


ATTACTTGTG 


51420 




AGGAAGCAGC 


TTTGCATCTG 


TGTGCTGACC 


TTGGGCGGGC 


GTCCTGAGCT 


CCTCGCAGGT 


51480 




GCTGTTGTAG 


CAGCTGTGCA 


GTAGGTCAGG 


GCTGGCCCCC 


AGTGCAGCTT 


TGCACATGAA 


51540 




GTAGGAGGAG 


GCCCTGCTGC 


TTGTCAGAGC 


CCAGCAGAGT 


CTTGGTGTTC 


TGTCGGGTTC 


51600 
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CTGTGGCCGG 


ACCAGTGGCA 


GGGTGCTGTG 


GAAGCTGTCG 


AATCTCCTCC 


CTCTGTCCAG 


51660 




TACCCCCGCT 


CGTCTTCTAG 


CTCCCTCCTA 


CGCCCGGGCC 


ACGTTTCAGT 


TATGCTCACT 


51720 




TCCTCTGACC 


GCCGAGGCTC 


CTGCGTGTCT 


CCATACAGCT 


CACGCTGCAG 


GGCCACGCTG 


51780 




TGGGTGTTGG 


AGACAGCTCC 


TCCTCGACCC 


ACGGTGCTCT 


CTCCCACCAG 


GCCTAAGGTG 


51840 


5 


TGTGTGCTGG 


GCTCCCGGGA 


GCAGCTGTGC 


ATCCATCCTG 


AGGTGAAGAA 


ACAAGAGAGT 


51900 




AACCATCTAC 


AGGTAGGCTC 


CTGGGCTCCC 


GCTCCGGCTC 


AGTGTCCGAC 


AGGCGAGTGC 


51960 




TGCTGGGTGT 


CCAGAGCCCC 


AGGCTGCGCT 


CCCGCTGGGC 


TAGGGTTTGA 


AGTTCACTGG 


52020 




GGGACTGCAG 


GGGAGGACCT 


GGTGGGGGTG 


GGGACTGGCT 


TCGGTCCTTT 


CTTGGCCGTG 


52080 




CTTCAGCTGC 


GCACTCTGCC 


CTTCCTCCCA 


CAGATCCACT 


TGTGCCGTAA 


GAAGGTGGCA 


52140 


10 


AGTCGCTCCT 


GTCATTTCTA 


CAACAACGTA 


GAAGGTACAA 


GCAGCTGGGT 


GGGACCAGGG 


52200 




TCGGGTTGGA 


GTGTGTGCAG 


CCTCTCAGGG 


TGGAGCTCAG 


TGGTGTCACA 


GCCTGGTTGT 


52260 




GCTTGCCCGG 


TGGGGCGGCC 


AGTGCGGCCA 


TGTACCTGGG 


CCCTGTCTTC 


TGACTCGGGG 


52320 




CCACCCATGT 


TAGACTTCTG 


TGTGGAAGAG 


CTCACACAGT 


GGTCTGAGAC 


AGCCAGCCGG 


52380 




CAAGACTGCC 


TCTGGCTGGT 


GCCTGGGGCC 


TTGGATTTTG 


GGAAGGCTCC 


CTCCATTTCC 


52440 


15 


TGATGAGAGG 


GTCTCCCTGC 


ACCTAACCTG 


CTGGTGCAAA 


CAGTAGGGGT 


TTTGCTGAAC 


52500 




ACCGGCTTTC 


TCTTCGGGGA 


CTTTGTTGCT 


TGCCCAGCAG 


CAGGTGCTCC 


AGTGACCGGC 


52560 




CCTCATACCA 


TCTTGGGAGG 


GTGTCCTGGA 


AGCCGTGTCT 


GGCCTCCCGC 


GACCCTGCCC 


52620 




CGTGTGTCTT 


TTTCCTGTGC 


TGACCTTGCT 


GCGGAAAATT 


ATGGCCCTGA 


GTGTGACTCC 


52680 




AGGCTGAGTC 


CTGTGGGTCC 


AACACGGGAT 


GCCTTGGGGC 


CTCTTCTGGA 


GACGGGATGT 


52740 


20 


GAGTGACAGG 


AGCCGGCCGG 


GGCAGCTTGC 


CCTGTGACTG 


CACGTGGCCA 


CAGCCTGTGA 


52800 




GGGCCGGGGG 


TGCTTCTCCA 


CCCACGTGGC 


TGCCCCTCGG 


GTATGTCAAG 


GGCTTCTGGG 


52860 




GCTCATCACG 


GGGTCCTAGA 


GACAGTGGCA 


GGGTGCACCC 


CCGTTGGCTG 


CCCTTACAGT 


52920 




TTCTGTGACC 


TGAGGGTGGC 


ATCTGTGCAG 


TCGGCGCGGT 


CTGTGCTTCT 


GTGGGATCAG 


52980 




GGTTCCCTCT 


GTTTCCTGCC 


TCAGTTGGGG 


CTCAAGCCTC 


AGGTGAGGTG 


GCCCCGGAGC 


53040 


25 


ACTCAGAAGG 


CATCGGCGGT 


CCTGTGGGCT 


GCTTTCTGCA 


CTCACGTTTG 


CTGAGTGCTC 


53100 




AGTGTGCCAG 


GACTGAGGAC 


CCTGAAGCTG 


CTCTTGTATT 


TAGGGCGGCG 


CTCCCCTGGC 


53160 




AGAGACTGAG 


CCAGGTGGTC 


CCGCATGACC 


CACTACCAGG 


CGTTTCTGGG 


CCCTGGCCCT 


53220 




TGGAGGGACA 


GGGTGGGCGG 


AACATGGGCC 


TGCAGGGAGG 


CTCCCGCTTA 


CTGGAGGCAT 


53280 




GTGCTGTGTT 


GCTGGAGACA 


TCCTCTGTGT 


TGCTTCTTGT 


TCGCTGTGGT 


TTTTGGTCTG 


53340 


30 


GTGGCACCAA 


GGACCCTCAG 


TCATCTTGAT 


GTGTGGTTGT 


CCAGGCCTTT 


TTGTTGGTCC 


53400 




TAAGAAGGGG 


CTCTGCCTTT 


GTGCCCCCAG 


GTTCCCTGAC 


AGGAGCTGCC 


GGCTCGTCCC 


53460 




GGTGATGCCT 


GCAGGACGTG 


ACTCTGGGAC 


GGGGGGTTGG 


GCAGATGTGC 


TGATGGAAAT 


53520 




TCTCAAGCAG 


GCGTCATTTC 


CGAGGTCCTC 


ACCTGGATTT 


CCAGGACAGG AGTGCCTGCT 


53580 




GGGTGTCCCC 


AGTCCCATGC 


AGCGGGGGTC 


CTTGGGATAG 


CATGGAACGC 


TGAGCATGGG 


53640 
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CCTGGCCGGC CGTGGTCCTG 
GGTGGGGACG CTGGGCCTGG 
CCTCTGACTG CCTCCTGGTG 
CATCTGTCTC CTCCTCCAGG 
5 CTGTCCCCCT TGGCTTGGGC 
CCTTCGCACC CTGTCTGCCA 
GCCTGAGCAT CTTCAGAGTG 
GTATTTTGGG CGAATGAGGA 
TGAGGACTCT GGTTGGGGAC 

10 GCCCCGCCCC TCGTGCCCCT 
CGCCAGGCAA GGTTTGGCAA 
TGCTATGGTG CTGCCCAGGG 
GTATCTGGGG TTGGGTCATC 
GTGCATATGT GTGCGCTTCT 

15 CCTCCTGGGG ACAGGCACAA 
GCCCCACGTG GCTGCCCTGT 
TGTGTTTAAG CAACTCCCTA 
GAGGCAAGGG AGGGGCAGGA 
GGCCTCTCCG CCAGTACAGG 

20 GGGCCACTGT TTAGCCCAGG 
GTGGTTGGCA GGGAAGCTGA 
GGAGATGCCT CTGTGGGTTG 
GGGCTGAGTG TAGCCGCCAC 
ACCCTCGGGA TCCTGCTGGC 

25 AGAGGAGGAT TCCCTCAGGA 
AGTGGGTGCT TTGTGGCGCG 
AGGAGCCGTT CTGTCCTGGG 
TGTGTTTACC CCTGCCTCAC 
ATCCTGGACA TTGAGGACTT 

30 AGGCCACGAC CACTGTCCTT 
TATTTCTTCA CTTTTCTTTG 
AAAAGTTTAA GCAGCTCTGA 
TCAGCCATCC CCGGTGGCCA 
CGGAAGCACG CGCAGTTTTG 



GACAAGGGCA GTGCCCCGGT 
TACCTGGTGG GGATGCTGGG 
CTGCTTCCGT CTGTGTTAGG 
CCTGTGGACT CAGACCAGGA 
TCTCACTGCC CGACCTGGCG 
CCGGACAGGC TGTGAGGGGG 
GGCTGCAGCT CCTGGAGGGG 
GACAGCTGGA GAGCTGGCAC 
AGCAAGCTTG GGGTCAGCCT 
TCCCCTCGCA GCTCCTGTCC 
GTGCCGCTGT GCGGCAGTGC 
GTGTGCTTTT CCTCCCCTGC 
GCTGGTGTGT GTGAGTGTGT 
GGCCTCTGCA GCTGAGTCCT 
AGCAGCCATG ATGGAGTCGG 
GACTCTGGGG TGCTTGTTAG 
AGGAATTCTT GTGGTTCCAG 
CATCCCCCAG ACTCTGACTT 
CAGTGTCCTT TGCCAGGGCT 
AAAACCTGGC TCTCCCTTAG 
GTGACGGTGC TAATCACAGG 
GGGCGATAGG CTGAGGGGCT 
TCCTGTCCTG TCTTGGGCTG 
CTCCGTCTGG TCCACCCTGA 
TCACTCGGGT GGGGGCCTCT 
TTCACGGTTG GTGGGGGACG 
CATCCCCCTG TGGTCTGGGA 
ACCTGCAGAA AAAAGCCTGG 
GGTCAAGAGC GGAAGCAAGC 
CCATGGCCCA GCTCTCCTGT 
TTCCTTTTTA AATTATGAAA 
GGAAGATAGA GTAAAAAATT 
CCGTGGAGTG TGGACGGAGC 
TCTGCACAGA CTGTCCTGCA 



GGCTGCTGGG 


CCTGGGACCT 


53700 


CCTGGGACCT 


GGTGGGGAGG 


53760 


CCTCTGGGTA 


TTGGGGCCCC 


53820 


AGACACAGGC 


CAGCCCCTGC 


53880 


GGAGGTTGCC 


TAGCCGTGAA 


53940 


TGTCTGCAGC 


ACCTGCACCG 


54000 


TCTGAGAGGA 


AGGGAGGCAG 


54060 


CCTTCCTGGC 


CTGCGTCCTG 


54120 


GGGGCAGAGC 


CTCTGGGACG 


54180 


TCGCCCCGCC 


CTCAGCTCTC 


54240 


CTGCTGATTG 


GCTGGTCTGT 


54300 


CTTCCCTGCT 


ATCCCTGGGA 


54360 


GTGTGTGTGT 


ATGTGCACGT 


54420 


GGCCCTCGGG 


GGGCCTGGCA 


54480 


GAGCTGGGGG 


AGGCCCCATT 


54540 


AAGAGGTATC 


TGGTTCTGTC 


54600 


TTTGGGGGGC 


CTGTACTGTA 


54660 


CTGAAGCCTT 


TTCTGCCCGG 


54720 


GCCATGCTGC 


AGAGGGGAGT 


54780 


CTGGAAGTTC 


TGGGCCTGTT 


54840 


GGCACCTGCA 


GGGGTTTGTG 


54900 


GTTCTTCCCT 


GCCCTGAGGA 


54960 


TCTCGGAGAG 


GATGCGTAGA 


55020 


ACCTCAGGCC 


TTCTGGGGGC 


55080 


CTTGGGCACC 


TGAGACCCTC 


55140 


CCCAGCCCTG 


CCCGCCGTGT 


55200 


CTTAGTGGAC 


CCTGAGGGTG 


55260 


AGCAGGAGCT 


GGCCAGCCCC 


55320 


ACAGGTGAGA 


CCCCTCAGTG 


55380 


GACCTGTGGA 


GGCCCGGATA 


55440 


CTAACCACCA 


TTCAGTACGA 


55500 


GTCTCCCTCT 


TCCCTGGCCC 


55560 


CCTGCAGGCC 


TGTGTCTGTG 


55620 


GTTGGCTGTT 


TTCACTCAGC 


55680 
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GTTGTGGGTA 


TAGCTTCCCA 


TGCTGGTGCT 


GGCAGCTCGG 


CCTTGTTCTT 


TTGAGGACAG 


55740 




CAGATGTCTC 


CTATGTCTAC 


CTCTTACAGC 


TTCAGAGATT 


CAAGTTATAA 


TAAAGCTCTT 


55800 




CTTATATTGA 


GGGGGAAACC 


TCCCTCCCCC 


TTTTTTTTGA 


AACAGGGTCT 


CGCTCTGCTA 


55860 




CCCAGGCTGC 


AGTGCAGTGT 


CACAGTCTTG 


GCTCACTGCA 


GCCTCAGCCT 


CCCAGGCTCA 


55920 


5 


AGCGATTTTC 


CCACCTCAGC 


CTCCCAAGTA 


GCCGGGACTG 


CAGGCACGCA 


CCACCATGCC 


55980 




TGGTTAATTT 


TTGTATTTTT 


TGTACAGACA 


GGGTCTCACT 


CTGTTGCTCA 


GGCCAGTCTC 


56040 




CTGAGCTCGA 


GAGTTCCACC 


TGCCTTGGCC 


TCCCAAAGTG 


CTGGGATTAC 


AGGCGTGAGA 


56100 




CCCCATGCCT 


GGCCAGCTCT 


TTTTTTTTTT 


tttttttttt 


TTGAGACGGA 


GTCTCGCTCT 


56160 




GTCGCCCAGG 


CTGGAGTGCA 


GTGGTGCGAT 


CTCGGCTCAC 


TGCAAGCTCC 


GCCTCCCGAG 


56220 


10 


TTCACGCCAT 


TCTCCTGCCT 


CAGCCTCCCG 


AGTAGCTGGG 


ACTACAGGTG 


CCCGCCACCA 


56280 




CGTCTGGCTA 


ATTTTCTGTA 


TTTTTAGTAG 


AGACGGGGTT 


TCACCGTGTT 


AGCCAGGATG 


56340 




GTCTCGATCT 


TCTGACCTTG 


TGATCCGCCC 


ACCTCGGCCT 


CCCAAAGTGC 


TGGGATTACA 


56400 




GGAGTGAGCC 


ACCGCGCCCG 


GCCCAGCTCT 


GCTTTTTCTT 


AGTGGTTCTG 


CGTTGTGTTT 


56460 




GTTTCTATCC 


AGGAATAGGG 


TTGGTTTTAC 


TTTTCCATCG 


AGTTTTTAAA 


GAGACGACGA 


56520 


15 


TTTACATGGT 


CGGAAACTCA 


CGAGGACTCC 


CCATCCCTTG 


GTCGGAAACT 


CACATGGACT 


56580 




CCCCATCCCT 


TGGTCAGAAA 


CTCACGTGGA 


CTCCCATCCA 


TCCCAGGCAG 


CAGCTTCCCA 


56640 




CCTGGGCCCT 


ACGTGCAGGA 


TGAGGGCTCC 


TTCCGGGTCA 


GAAGACATGG 


CGGCCTCGGG 


56700 




GCACCGTCCC 


CTGCATGGGG 


TGCTCACAGG 


ATCTTCTCCT 


CTCTCCTTCC 


CAGGGTGTGC 


56760 




CCTTACTACC 


TGTCCCGGAA 


CCTGAAGCAG 


CAAGCCGACA 


TCATATTCAT 


GCCGTACAAT 


56820 


20 


TACTTGTTGG 


ATGCCAAGGT 


GGGGGCTCAG 


TCCTGTAGCT 


GACGACTCCT 


GATGTCCAGG 


56880 




GGTGTCCCTG 


GGCTTGGGAA 


CAGCTGTCCG 


AGCCTTTGCT 


GCTTCAGGGC 


CTTAGATCAG 


56940 




CAGGCCTGGG 


TGGGAGGACT 


CACCTCTGTC 


ACTGGGCAGG 


GGCTCAACCT 


GGCCAGACAC 


57000 




ACTTGTGAGC 


AGCCCCAGGC 


CACAGGTCAG 


TTTTCTGAGC 


AGTCTGGGAG 


CGGGCAGGCT 


57060 




GGTGGGAGTG 


AGGAGAGACC 


TCCAGGCTGT 


GGTCCATAGG 


CCAGTGCCCG 


CTCTTGATCC 


57120 


25 


TGACAGCTCA 


GGTTCTCTCC 


TTCACGTCAG 


GCCATGGGAG 


GCACCGAGAA 


CACAGGAAGC 


57180 




CCACTGACTC 


CCCTCTTCCC 


AGCGCGTGCC 


CGGCCCCACA 


CTCACTCCCC 


CTCCCAGCAT 


57240 




GTGCCCGGCT 


TCACACTCAC 


TCCCCTCTTC 


CCAGTGCATG 


CCCGGCCCCA 


CACTCACTCC 


57300 




CCCCACAGCA 


TGTGCCCGGC 


CTGACACTCA 


CTCCCCTCCT 


CCCAGTGTGT 


GCCCAGCCCC 


57360 




ACTCCCTTCC 


GCCCCGTGTG 


CCCAGCCCCA 


CGCTCACTCC 


CCCCGCCAGC 


ATGTGCCCGG 


57420 


30 


CCCCACACTC 


AACTCCCCTC 


CTCCCAGTGT 


GTGCCCGGCC 


CTGCTGCCCT 


CCTCCCCATG 


57480 




TGCCCTGCTT 


TTGTGCCCCA 


CACTTTTTAC 


TTAGTGCAGG 


TGGGATCACA 


CGCCACGGGT 


57540 




CAATGGTTTG 


TGTGTTCACG 


TGACGATGGC 


GTGGTGACGT 


TTCCAGATCC 


CGTCGTTGGT 


57600 




TCGCTCATTC 


TCGGGGTGTA 


TATTTATTGA 


GAGCTCATCA 


TGCTGGGTGC 


TATTCCAGGC 


57660 




ATAGCAAGAC 


TGGCTTCACT 


CACATGGAGC 


TTTGATTCTA 


GTGGTGGGGA 


CAGGTGGACA 


57720 
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GCAAAAGAGT AAGCACGTGA GCTGACGATA 
GGAGACCGAG CCAAGCGGGC CCAAGTGCGA 
TCTGAGGGGA GCCTCAGCAG GTGCAGCAGA 
GGGGCCGACT CCTGGGAAGC TGTAGCAGAA 
5 GTTGTGGGTG ACTCGGTGCT TTGAGCCCTG 
TAACCCAACC AGGCCCCTCC CTGGGACAGT 
CTTTCACGGA AACGCAGAAG ATCTAAAACA 
GAGGACTCAC CTCGCTGGTT TCAAGACTCC 
GCCCAGCTCA GGCACAGGCC TGCAGGCCAT 

10 ACGCATGTGG TCATTGTTTT TTTTTTCAGT 
TGCAGTGGCA CCATCTCGGC TCACTGCAGC 
CATCAGCCTC CCGAGTAGCT GGGATTACAG 
TTGTTTTTTG ACAAACATGC CAATTTAATT 
TTCTGAAAAA ATGGTGCTAA GAACAGCTGG 

15 TCTTGTTTTA CCCTGTATAA ACCTAAATGT 
CATGGGGGAG GTCTTTGCAA CTTTGGGGTA 
GCACTAGCCA TGAAGAAAAA CATTAAAATT 
CTGTGGAAGA GTTGAGAAAA TGAAAAAGCA 
AAGGACTTAA AAAATTTTTT CAGCCCTCCT 

20 GTAAAATTCT CCCATAACTA AGCAAACAAC 
GACGTTTCAC CAAGATGGGT GGAATGGCCA 
GTTTCGCCAA GATGGGTGGA ATGGCCAGTT 
TCGCCAAGAT GGGTGGAATG GCCAGTTAAC 
CCAAGATGGG TGGAATGGCC AGTTAACCAC 

25 AGATGGGTGG AATGGCCAGT TAACCACTGG 
TGGGTGGAAT GGCCAGTTAA CCACTGGGAG 
GTGGAATGGC CAGTTAACCA CTGGGAGAGC 
GAATGGCCAG TTAACCACTG GGAGAGCATC 
TGGCCAGTTA ACCACTGGGA GAGCATCCGG 

30 CCAGTTAACC ACTGGGAGAG CATCCGGACA 
GTTAACCACT GGGAGAGCAT CCGGACAGAC 
GAGCACATGG AAAGTCGCCC AGCATCTCCA 
GGAGCCGACA CTGTGGTCCC ACTGGCATGG 
AGGATGTGGA ACAGCTGGAT CTCATCCATC 
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CTGAAGGGAA 


ATAGAGCAGA 


GGGAGGAGGC 


57780 


TGTCGGCGGG 


AGGTGGGGAA 


TGCTGGTGGG 


57840 


GCAAGGGAAG 


AGGTGAGTGG 


GGGCGGCTGG 


57900 


CCCCACAGAG 


AGCTGGTGAG 


GTTTGCCGTG 


57960 


GCTGCCCCTG 


GGAACCATCT 


GGAGAGCTTC 


58020 


TATATCACAG 


CTGGTAAGCC 


GAGTCTAACA 


58080 


GCAAGATGAC 


CGTGAAGAAG 


AACAGAGCTG 


58140 


TCTAAAGCTG 


CAGGAGTGGA 


GGTGGAGATG 


58200 


GGAGAAGGCA 


GCAAGCTCAA 


GCTGACCCAC 


58260 


TGGAATCTCA 


CTCTGTCACC 


CAGGTTGGAG 


58320 


CCCCGCCCCT 


AGGTTCTAGC 


GATTCTCCCA 


58380 


GCGTGCGCCA 


CCATGCCTGG 


CCCTTGGTGA 


58440 


GAGAGAGGAA ATGAAGGTTG 


ATTTCTGGTT 


58500 


ATATCTGTTC 


GGAAAACAGT 


GAATCTTAAC 


58560 


AAAAGCTAAA 


CTAAAAGTTA 


TAGAAAGGAA 


58620 


GGCAGAGATT 


TCTTAGTATG 


GATACACAAG 


58680 


TAGACTTCAC 


CAAAATTTAA 


AGCTTCAACT 


58740 


GTTAAAGAAA 


GGGAGAAAAT 


ACTTCTTTCA 


58800 


CTGATTTGAA 


AGGACCTTTG 


ACCAGAGTAT 


58860 


CCACTTAACC 


ACTGGGAAGG 


GATCTGGACA 


58920 


GTTAACCACT 


GGGAGAGCAT 


CCGGACAGAC 


58980 


AACCACTGGG 


AGAGCATCCG 


GACAGACGTT 


59040 


CACTGGGAGA 


GCATCCGGAC 


AGACGTTTCG 


59100 


TGGGAGAGCA 


TCCGGACAGA 


CGTTTCGCCA 


59160 


GAGAGCATCC 


GGACAGACGT 


TTCGCCAAGA 


59220 


AGCATCCGGA 


CAGACGTTTC 


GCCAAGATGG 


59280 


ATCCGGACAG 


ACGTTTCGCC 


AAGATGGGTG 


59340 


CGGACAGACG 


TTTCGCCAAG 


ATGGGTGGAA 


59400 


ACAGACGTTT 


CGCCAAGATG 


GGTGGAATGG 


59460 


GACGTTTCGC 


CAAGATGGGT 


GGAATGGCCA 


59520 


GTTTCACCAA 


GGTGGATGGA 


ATGACCAGTT 


59580 


GTCATAGGAG 


AAGGCAGATT 


AAAGCCACGG 


59640 


CTGAAATTCA 


GAAGCCCTGA 


GTGTGGCATG 


59700 


GCTGTGAAGT 


TGTGTAGCCA 


CTCCACAAAC 


59760 
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GTGTGGCAAA 


CAGCCGAGCC 


GGGAGAAGGG 


AAGACGTGTT 


CAAAGATTCA 


TATGTGGCCA 


59820 




GGCTCAGTGG 


CTCACGCCTG 


TAATCCCAGA 


ACTTTAGGGG 


CCAAGGCTGG 


GGGATCGCTT 


59880 




AAGCCCAGGA 


GTTTGAGACC 


AGCCTAGGCA 


ACATAGGGAG 


ACCCCATCTC 


AAAAAAAAAA 


59940 




AAAAAGAAAA 


AAGAAAAGAC 


TTCAGTGTGC 


AGGTTTACCA 


GAGTTTTGTT 


TGCAGTTGCC 


60000 


5 


AAAACTGGGA 


AGCAGCCCGC 


GTGAGCCCAT 


CCACAGGTGA 


ATGGACAGAC 


CGTGGTACCC 


60060 




GAACACTAAC 


AGCAGCCACG 


GGCGTGGACT 


GTGGTCACAC 


AGCAGCAGGG 


AGCCGATGAG 


60120 




TCTCGGACAT 


GCTAACCCAG 


AGAGGCCCAT 


TGAGGAGGAC 


CTACTGTTTT 


TTGTGTTTTT 


60180 




GTTTTTTGTT 


TTGAAATGGA 


GTCTCGCTCT 


GTGGTGCAGG 


CTGGAGTGCA 


GTGGTGTGGT 


60240 




CTTGGCTCAC 


TGCAGCTTCC 


GCCTCTTGGG 


TTCAAACAGT 


TCTCCTGCCT 


CAGCCTTCCG- 


60300 


10 


AGTAGCTGGG 


ACTACAGGCA 


CCCGCCACCA 


CACCCGGCTA 


ATTTTTGTAT 


TTTCAGTAGA 


60360 




GACGGCAGTT 


CGCCATGTTG 


GCCAGGCTGG 


TCCCAAACTC 


CTGACCTTGT 


CATCCACTCA 


60420 




CTTTGGCCTC 


CCAAAGTGCT 


GAGGTTGCAG 


GCATGAACCA 


CCGCACCCGG 


CTGGACCTAC 


60480 




TGTTTTATTC 


CATTTATGTG 


ACACTCTATT 


AATAGAAAAG 


GCAGGGGTGG 


GGCTGGTGGT 


60540 




TATATGGTGC 


ACATAACTGC 


CAGAACTCAG 


TACACTTAAA 


ATGAACATCT 


TAATGTGTGA 


60600 


15 


AATTTTTTTT 


TTTGAGACGG 


GGTCTTGCTC 


TGTCACCCAG 


GCTAGAGTGC 


AGTGGTGCGA 


60660 




TCTCCACTCA 


CTGCAAGCTC 


TGCCTCCTGG 


GTTCACGCCA 


TTCTCCTGCC 


TCAGCCTCCC 


60720 




GAGTAGCTGG 


GACTACAGGC 


GCCCGCCACC 


ACGCCTGGCT 


AATTTTTTTT 


TTTTTTTTGT 


60780 




ATTTTTAGTA 


GAGACGGGGT 


TTCACAGTGT 


TCGCCAGGCT 


GGTCTCGATC 


TCCTGACCTC 


60840 




GTGATCCGCC 


TGCCTCGGCC 


TCCGAAAGTG 


CTGGGCTTGC 


AGGCGTGAGC 


CACCATGCCC 


60900 


20 


GGCCAATGTG 


TGAAAATTTA 


AAAGTACCAA 


AGCTGGACCC 


CACCCCAGAT 


TGCTCCCATG 


60960 




ACACTCTGTG 


GGTGGGACCT 


GGGAGTTGGG 


TTTTGTTTTG 


TTTTGTTTTG 


TTTTTGAGAT 


61020 




GAAGTCTCAC 


TCTGTCGCCT 


AGGCTGGAGT 


GCAGTGACAC 


AATCTCGGCT 


CACATTAACC 


61080 




TCTGCCTCCC 


AGATGAAAGC 


GATTCTCCTG 


CCTCAGCCTT 


CTGAGTAGCT 


GGGATTACAG 


61140 




GCACACACCA 


CCACCCCCTG 


CTAATTTTTG 


TATTTTTAGT 


AGAGACGGGG 


TTTTACCATG 


61200 


25 


TTGGCCAGGC 


TGGTCTTGAA 


CTCCTGACCT 


CGTGATCCGC 


CCGCCTCGGC 


CTCCCAAAGT 


61260 




GCTGGGATTA 


CAGGCGTGAG 


CCACCGCGCC 


TGGCTGGGAG 


TTGGGTTTGT 


AAATCTCCCT 


61320 




GAGTGGGGCT 


GGGGCAGGGA 


ACTGCTGGGT 


CTGGGTCTTC 


CTGGCTCCTC 


TGGTCTGTGG 


61380 




CTTCCTGACT 


GCGGTGGCCG 


GGGGCTCCCA 


GGGCATCGTG 


GCCGTCTGTC 


TTGCTGAGCG 


61440 




TGGCACGTGC 


CTTTCCATGC 


TGTGGAGGAG 


CGTCTCCCGG 


TATGGCGAAC 


TGCTGGTTAG 


61500 


30 


GGTGGGGCGG 


TGTTGCCAGG 


TCATCCAGGT 


CTGGCCTCTG 


CTCTCGACAT 


CGCCGGCGCT 


61560 




GTTGCTCATC 


TGCGCTTGTG 


ATGTTCGATG 


CCTGCTGCAC 


ATGTCTTGGC 


TTCCCTCTTT 


61620 




CCCGGCCTCT 


GTGAGCTCCA 


GCGCTGCGTC 


CCTTCTCTTC 


CTCCTGTAGA 


GCCGCAGAGC 


61680 




ACACAACATT 


GACCTGAAGG 


GGACAGTCGT 


GATCTTTGAC 


GAAGCTCACA 


ACGTGGTGAG 


61740 




TCTCCGCTGG 


CCTCCTAAAC 


ACCTCCTATT 


GCTTCTGGCC 


TTTTTGTCAA 


GAGCCACGCA 


61800 
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AACCTTTCTG 


GAGGGGCTCT 


GGCCAAACTC 


CTGAAGCCCT 


AGGTGCCCAG 


GACTGGGGAC 


61860 




TGAGCACACC 


AGGAGCTTCT 


GCCACCCCCT 


CCCGCCCTGA 


TCCGATGCCT 


CTGCTGGGGC 


61920 




TGGAGACTGG 


CCAGCTGGGC 


CAGGGACCTG 


CCCGTCAGGC 


GCAGGGCCCC 


CACAGGCCGC 


61980 




TCACCAGACC 


CTTTCCCTCC 


AGCCAGCTCG 


GGGTCAGCCT 


GGGCCAGGGC 


TGTCTCCTCT 


62040 


5 


GCCCTCGGCA 


GCAGCAGGCT 


TGTGGTCTTG 


CCTGCAGTGT 


CTCTGCCCTT 


CCGGCCACAT 


62100 




GGCTTGAGAC 


TGAGGCAGGA 


GAATCGCTTG 


AACCTTGGAG 


GCAGAGGCTG 


CAGTGAGCCA 


62160 




GGATCACACC 


ACTGCATTCC 


AGCCTGGGTG 


ACAAAGCGGG 


ATTCTGTGTC 


AAAAAAAAAA 


62220 




ATGTTGACTG 


GGCGCGCTAG 


CTCATGCCTA 


TAATCCCAGC 


ACTTTGGGAG 


GCTGAGGTGG 


62280 




GCGGATCACG 


AGGTCAAGAG 


ATCAAGACCA 


TCCTGGCCAA 


CATAGTGAAA 


CACCGTCTCT 


62340 


10 


ACTAAAAATA 


CAAAAAAATT 


AGCTGGGCGT 


GGTGGCGTGT 


GCCTATAGTC 


CCAGCTACTC 


62400 




AGGAGGCTGA 


GGCAGGAGAA 


TCACTCGAAC 


CCAGGAGGTA 


GAGGTTGCAA 


TGAGCCAAGA 


62460 




TCACACCACT 


GTACTCCAGC 


CTGGTGACAG 


AGCAAGACTC 


CGTCTCAAAA 


AAAATAAAAT 


62520 




CAAAAAGAAT 


AATTGGCAAT 


TCCAGTGAAA 


TAATTGTTTG 


TTTGTTTGTT 


GAGACAGGGT 


62580 




CTCCTTCTGT 


CGTCCAGGCT 


GGAGTTCAGT 


GGTATGATCT 


TGGCCCACTG 


CAACCTCCAC 


62640 


15 


CTCCTGGGCT 


CAAGCCATCC 


TCCCACCTCA 


GCCTCCCGAG 


TAGCCGGGAC 


TACAGGTGCA 


62700 




CACCACCACG 


CCCGGCTAAT 


TTTTGTATTT 


TTTGTAGAGG 


CGGGGTTTCC 


CAGCGTTGCC 


62760 




CAGGCTGGTC 


TTGAACCCCT 


GAGCTCAAGT 


GATCTGCCCA 


CCTTGGCCTC 


CCAAAGTGCT 


62820 




GGGATTACAG 


GTGTGAGCCA 


CCGCGCCCGG 


CCTGAAACAA 


TCGTTTCTAA 


ATATTGGTGT 


62880 




GGGCCACACA 


GTCATGTTTG 


GACCTACTTG 


TGGCCTTTTA 


CAGACCCCAG 


GCCAAGGCTT 


62940 


20 


TGGGAACTTG 


GCTGTCAGCC 


TCCTGTGCCT 


TCTGCACCCC 


CACCCCATTT 


CTGCTTTCTG 


63000 




GAACCCCCGA 


TCCTGTCCTG 


TTCTGTGGTG 


ATTCGGGTGT 


GCTTGGGCTC 


TAGGAGAAGA 


63060 




TGTGTGAAGA 


ATCGGCATCC 


TTTGACCTGA 


CTCCCCATGA 


CCTGGCTTCA 


GGACTGGACG 


63120 




TCATAGACCA 


GGTGCTGGAG 


GAGCAGACCA 


AGGCAGCGCA 


GCAGGGTGAG 


CCCCACCCGG 


63180 




AGTTCAGCGC 


GGACTCCCCC 


AGCCCAGGTG 


CGTTCATAGC 


CAGACTGCTT 


GGTCCTGAGG 


63240 


25 


CCTGCGCTGC 


TGCAGGGTGA 


GCCCCACCCG 


GAGTTCAGCA 


CGGACTCCCC 


CAGCCCAGGT 


63300 




GCGTTCATAG 


CCAGGCTGCT 


TGGTCCTGAG 


GCCCGTGCTA 


CTGCAGTGGG 


CAGCCTGCCC 


63360 




TGTGGCTGTG 


TGTGGTCGGC 


CTGGGCACCA 


TCTATTCAGG 


CTGGCACTGC 


AGGGCATCCG 


63420 




CTTCTCTCAG 


AGGCTTCTTG 


GGTGTGAATT 


CTTCAGGGTC 


CTGTAGCCTG 


TGGAAGGGCT 


63480 




GGTATTGTTC 


AGTAGTTCTG 


GTATTTTCCA 


AAGACCTATG 


TCTTCTCCCA 


GCCAGTATCA 


63540 


30 


ACTTGGCCTC 


TACTGTGTAA 


AACTGGAAAA 


CTCTACTTTG 


TGAAGCTGAG 


TTGGGAGCAT 


63600 




CGCTTGAGGC 


CAGGAGTTTG 


AGACCAGCCT 


GGGCAACATG 


GCGGAACCTC 


GCCCCTGCCA 


63660 




AAAAATTAGC 


CAGGTGTGGT 


GGTGTGCTCC 


TGTGGTCCAA 


GCTTTTCTGG 


AGGCCGAAGT 


63720 




GGGAGGCGTG 


CTTGAGCCTG 


GGAGGCAGAG 


CTTCCGGTGC 


CCCAGATGAC 


TCCACTGCAC 


63780 




TCCAGCCTGG 


GCGGCAGAGT 


GAGGCCATCT 


CAAAAAAAAA 


AAAAAGGAAA 


ACTAAATATA 


63840 
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TTCACTGTAA GGGCATTTTG 
TCTGCCTGTA GGTTCCTTCC 
ACGGCATATG TGGAAAACGT 
TTTCTTTTTT TTTTTTTTGA 
5 GCGCGATCTT GGCTCACTGC 
CCTCCCAAGT AGCTGGGATC 
AGTAGAGATG GTTTCTCCGT 
CACCCGCTTC GGCCTCCCAA 
TGTCCATATT TTCTACATGG 

10 CATAGGTTCT GCTGTGACAC 
TTCCCCCATT TTCCTTTAAA 
GGGGAGCCCC GTCCTCACCC 
AAGTTGGCAT TTGTTTAGTA 
TATATATATA TATACATATA 

15 GTGGTGGTAC AATCAAAGTT 
TCTCAGCGTC TTAAAGCGCC 
TGCCACGCAG GGAGTTGCAG 
CTCCTCTCCA CAGTCCCCCA 
TCCTTCGCTT GGCGCAGGGT 

20 CCATTTCATC GCTTACATAA 
ATCACCACGG GGCCTTCTCC 
TGTTGTAACT AGTGGGGGAC 
AACTTCAGTT TTCATCCCTA 
CACCCAGGCT GGAGAGCGGT 

25 CTCCACCTCT TGGGCTCAAG 
GGCGTGTGCC ACTGTGCTTG 
GCCCAGGCCG TTTACCCTGC 
CCCGGTTTAC TCTGAACTTA 
CCGGGCTCCA GATGGGACGT 

30 CTTCCCATGT GAACATTGTT 
AAGATGGCAG TGTGCTAATT 
CCCTGCACCA TATCAGCTAT 
GGGGGTGGGC CGGGTGTGCT 
AGGCGGATCA CGAGGTCAGA 



CATC TTT AAA TGACCCACAA 
CAGTGTTTGT CCAGAGGTGT 
GGAAACCCTT CATGGATGTT 
GATGGAGTTT CACTTTTGTT 
AACCTCCGCC TCCTGGGTTC 
ACAGGCGTGC ACCACCACGC 
GTTGGCCAGG CTGGTCTCGA 
AGTGCTGGGA TTACAGGCGT 
CTTCTGTAAA CAGCTGACTA 
TACTTGCTCG TGAGCATCTC 
ATCGCACATG TGGACGGACA 
TTCTCACCAG GATGGCTGCT 
CAGAAGTTAT CAGGTGTTCT 
TTTAAGTGAC AGGGTCTCAC 
CCCTGTAGCC TCGGCCTCCT 
GGGACCACAG GTGTGCACCA 
TGGGGGAAGG GGTTCCTGGG 
ACCCCACCTC TCTAACGGGG 
GGGGAGAGTG ACAGGTCTCC 
CGTGGGAGAA ACATCCACCC 
ACTCTTCAGT TTTGTTAGTT 
GTGTGGTGGG GTGGGCTTCT 
TCTGTTCCCC CACCCCTTTG 
GGTGCCATCA CGGCTCACTG 
TGATCCTCCT GCCTCGGCCT 
GCAGGGTCCA TTCTTTTCCT 
AGAGTCGGAA TCTGTACAGG 
GGTGGCTTGA GGGCCCCAGT 
CCTTTCTATC AGAAGGCTCA 
GCTGCTGAAC ACCTGAATAT 
CCATCATCTA GTCAGTTAGC 
GTGGTGATCC CATTCACACA 
GTCTCACACC TGTCATCCCA 
GATTGAGACC ATCCTGGCCA 



ATCTGGCATG 


CATCAGCTGC 


63900 


ATTTCCACAC 


AGCGCTAGTC 


63960 


GTCAGTTGGT 


CTATATTTTC 


64020 


GCCCAGGCTG 


GAGTGCAATG 


64080 


AAGCAATTCT 


CCTGCCTCAG 


64140 


CCAGCTAATT 


TTGTATTTTT 


64200 


ACTCCTGACC 


TCACGTGATC 


64260 


GAGCCGCCAC 


GCCCGGCCTT 


64320 


GGAGTCTGTG 


TGAATATCTT 


64380 


CAGGTGTAAA 


CAGCATCAGC 


64440 


CCACGGGGAC 


CCTGGACCCT 


64500 


TGGTAGAGAG 


TGAGTTTGCA 


64560 


GGCTTTAGAA 


TCCCTTTATA 


64620 


TCTGTTGCCC 


AGGCTGGAAT 


64680 


GGGCTCATGG 


GATCTTCCCG 


64740 


CTGCCACCGG 


CTCTCAAGAT 


64800 


ACTTTGAACG 


CTCCACCTCC 


64860 


TGGACGGCCG 


CCTCTTTCCA 


64920 


TTCCCTCATC 


TCGGCAGCTG 


64980 


ACCCCCAGGC 


CTGTGTGAAC 


65040 


ACTTGATGTG 


CAGGGCTTTT 


65100 


GCCATCTCAT 


TCAGGACCAG 


65160 


GAGATGGGGT 


CTCACTCTGT 


65220 


CAGCCTCCAC 


CTCCTGCAGC 


65280 


CCCAAGCTCC 


TGGGACTACA 


65340 


CACACTTTAT 


TTATTGAAGA 


65400 


AGGGGCAGCC 


ACACGAGTTC 


65460 


TAGACTGCGG 


CCACCGTTTG 


65520 


CAGTATCTCC 


TTTCCCGTTT 


65580 


GTTAATCACT 


GGGGGCTTGC 


65640 


AGGAATAACT 


TAGGACCACG 


65700 


GGAAAGGTGG 


GACAAATGCT 


65760 


GCACTTTGGG 


AGGCCCAGGC 


65820 


ACACGGTGAA 


ACCCCGTCTC 


65880 
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TACTAAAAAT ACAAAAAAAT TAGCCAGGTG 
TGGGAGGCTG AGGCAGGAGA ATCACTTGAA 
ATCGCACCAT TGCACTCCAG CCTGGCAACA 
TCAATCAAGT GTCATCACTG AATGTTTGTG 
5 CTCCCTCCTG AATCTCACTC CTGACCTCAG 
TCTTGTGTCC TGACCCCGGC GGTTGTGGCC 
CTCATGTCCT GCTCCAGACC TGGAATCAGC 
CTGCAAGACG GTATTTCAAG ACCCGCCGTG 
GGTCGTTGTG ACTGGGTGTT TTCGTGGAGA 

10 TGTTAATTCT AAAGGTCAAA CACCCGGCCA 
TTTGGGAGAC CAAGACTGGT GGATCACCTG 
CAGGGTGAAA CCCCATCTCT ACTAAAAATG 
CTATAGTCCC AGCTAGTCGG GAGACAGACA 
TTGCAGTGAG CAGAGATGGC GCTGCTGCAC 

15 CTCAAAAATG AATAGATAAA TAAAGATAAA 
GATTTGGCCT GTGTAGCCTC TTCCTTCGCC 
TGTGGCCTCT TCCTTCCCCT GTTGGTGGAT 
CCTTCCCCTG TTGGTGGATT TGGCCTGCAC 
TGGTGGATTT GGCCTGCACG GATTCTGTGT 

20 GCCTGCACGG ATTCTGTGTG GCCTCTTCCT 
TTCTGTGTGG CCTCTTCCTT CCCATGTTGG 
CTCTTCCTTT CCATGTTGGT GTCCTTTTTT 
GGGTTGTTGG CACGAGCGTG ATGCAGACTG 
AACATGGAGC TGGAAGACAT TGCAAAGCTG 

25 GCAGCTGGGT GGGGCCTCCT CCTTGCGAGG 
ATCCTGCTGC GCCTGGAGGG GGCCATCGAT 
GTCACCAAGC CAGGGAGGTG AGAGGCGGGG 
GAGCTAGAAA CGGGCCATGG TGCAGTCCTG 
CAGGCCTGTT TTCCCTTTTT GAGACCTGGG 

30 TGAGATGTTG AGAGGAGCAG CACACACTCC 
GGCACACACA CCCATGGAAC AGCACACACA 
ACAGCACACA CACTCCCACG GAACAGCACA 
GAACAGCACA CACACTCCCA CGGAACAGCA 
CGGAACAGCA CACTCTCCCA CGGAACAGCA 



PCIYUSOO/33065 



TGGTGGTGCA 


TGCTTGTAAT 


CCCAGCTACT 


65940 


CCCAGGAGGC 


GGAGGTTGCA 


GTGAGCCGAG 


66000 


GAGCGAGACT 


CCGTCTCAAA 


AATCAATCAG 


66060 


TGTGAACGTG 


GGGATTGGTC 


CTGCCCCATG 


66120 


TTGCTGCACC 


TTGAGGTGTT 


TTCTGTGGGC 


66180 


TCTGCTGTCT 


GGGAGTCAGG 


ATTTTTCACA 


66240 


CAAGTCTCCA 


AGAAGCCCTG 


CTTTCTTTTC 


66300 


CGGCAGCGGG 


TTGGTCATGG 


TTACTGGGTT 


66360 


TACAGCCATA 


CGCACAGGTG 


TGTTCACAAA 


66420 


GGCATAAGGG 


CTCAGCGGTA 


ATCCCAGCAC 


66480 


AGGTCAGGAG 


TTTAAGACCA 


GCCTGAGCAA 


66540 


CGAAAATTAG 


CCGGGCATGG 


TGGCGCACAC 


66600 


CGAGAATTGC 


TTGAACCTGG 


GACATGGAGG 


66660 


CCCTGCCTGG 


GTGACAGAGT 


GACACCCTGT 


66720 


ACACCTGCTC 


CTCTTGGTGT 


CTCCAGTTTG 


66780 


TGTTGGTGGA 


TTTGGCCTGC 


ACGGATTCTG 


66840 


TTGGCCTGCA 


CGGATTCTGT 


GTGGCCTCTT 


66900 


GGATTCTGTG 


TGGCCTCTTC 


CTTCCCCTGT 


66960 


GGCCTCTTCC 


TTCCCCTGTT 


GGTGGATTTG 


67020 


TCCCCTGTTG 


GTGGATTTGG 


CCTGCACGGA 


67080 


TGGATTTGGC 


CTGCATGGAT 


TCTGTGTGGC 


67140 


CCATGCCAGG 


AATCCTGGTT 


CTCAAGGGCG 


67200 


CCTTTGCTGC 


CTTTCTCTTG 


CCCAGGGCTG 


67260 


AAGAGTAAGT 


GTTGCCCTCC 


CCGCCTCCTT 


67320 


AGGTGGGTGA 


CACCTCCTCG 


ACCCACAGTG 


67380 


GCTGTTGAGC 


TGCCTGGAGA 


CGACAGCGGT 


67440 


AGCCAGCCCC 


TTCACTGCAG 


GCCCAGCCTA 


67500 


GGCTGTCACA 


TCACGAGTGA 


GGCCTGTTTT 


67560 


AGGAGCACCT 


GCTTTGCATG 


ATCTGGTTGC 


67620 


CACGGGACAG 


CACACAGCCC 


CCCACGGAAC 


67680 


CTCCCACGAA 


CAGCACACAC 


ACTCCCACGA 


67740 


CACACCCACG 


GAACGGCACA 


CACACCCACG 


67800 


CACACACCCA 


CGGAACGGCA 


CACACTCCCA 


67860 


CACTCTCCCA 


CGGAACAGCA 


CACACACTCC 


67920 
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CACGGAACAG 


CACACACACC 


CACGGAACGG 


CACACACTCC 


CACGGAACAG 


CAGACTCTCC 


67980 




CACGGAACAG 


CACACACACT 


CCCACAGACA 


GCACACACAC 


ACCCACGGAA 


CAGCACACTC 


68040 




TCCCACGCGG 


GGCCGCTGGG 


TTTCCTGCAG 


TTTCTCCTCC 


TCCAGGCCTT 


TCCCTGGACC 


68100 




CTGGTCCAGT 


CCGTCATTTG 


AGCACAGGTG 


CCTGTTAGAA 


CGAGACCTTC 


TTGTTAGGAC 


68160 


5 


GATGAGTGTC 


CCAGCCACCA 


CCTCTTTTGG 


ACTCCGGGAG 


GCCTGGAACG 


TTCTGAACGC 


68220 




TCCGTGGGGC 


TCCAGTCTTC 


TCCGCAGCCA 


GGGCAGCAGG 


GTTTGCTGTC 


TGTCCTGCAG 


68280 




GCAGATGAGG 


AGTCAGGGCT 


GGGGCCTGTG 


TGGGGGCTCT 


CCTGAGCGCG 


CAGCCGCCGA 


68340 




GGTGGAGCGT 


GTTCTGCCTG 


AGCGCCGACC 


TGGTCGGGGG 


AATCCCAGTT 


GCTTCCAGGT 


68400 




GGAGCCACTG 


TCCTCAGCGT 


AATGCTCAAG 


GCTCTGGCCT 


GGCTCCTCGG 


CCACCCTGCA 


68460 


10 


CCCTCAGGGT 


CCCCTCCTGT 


AGCTTCTGCT 


GCCCCATCAC 


TGTCACTCTC 


CAAAGCTTTG 


68520 




GGGACTCTGC 


CCAGAGCCAC 


CGCCTCCCAG 


AAGCCCCTGA 


CAACCTCTTG 


ACGACCCCCT 


68580 




AGTGACCCCA 


TCCCTCCCCT 


CTGACGGCGG 


CCCCTGCTCT 


GAGGCGGCTT 


CTTTTCCTCG 


68640 




GTGCTGTTCT 


CGTGCTGGCC 


AGGCCTCCTC 


TCCCCACCTG 


GAGGCTCCTG 


AGGGCGGAGG 


68700 




CCTCTCACCT 


CCAATGCTGG 


CGTCCCCTGG 


AGGGCTGAAT 


TTGTTTCCGA 


GGGAAGGAAA 


68760 


15 


CTTCCACAGT 


TGTTGCCTTC 


AGTTCCAAAG 


CTGCAGCCTG 


ATTTCCCCCT 


CCAGGCTCGA 


68820 




GCCTGTTTTC 


TTCTCGGCAG 


CTACATCTTT 


GACCAGTGTC 


GTCCCCCCTC 


AGGCCCGAGC 


68880 




CTGCCTTCTT 


CTCCTCAGTT 


CCCAAAGCTG 


CAGTCTGGTC 


CCCCCGCCAG 


GCTCGAGCCT 


68940 




GCCTTCTTCT 


CCTCGGCAGC 


TACATCTTTG 


AGCTGTTTGC 


TGAAGCCCAG 


ATCACGTTTC 


69000 




AGACCAAGGG 


CTGCATCCTG 


GACTCGCTGG 


ACCAGATCAT 


CCAGCACCTG 


GCAGGACGTG 


69060 


20 


AGTGCTGGCA 


CGGGGTCTTT 


GGTGCGGGCA 


AATGTGGCGT 


AGGGGGTGCA 


GCAGGCCTCC 


69120 




ATCTTGGCAG 


TCAGGGCTCC 


CCTGGCCGTC 


ACCTGGCCGT 


CAGCAGGAAC 


AGGCCCACAG 


69180 




AACCTCATCT 


TCTGATCGGG 


GCGTGGAGGC 


GTTAGTGCCA 


CTTGCCAGCT 


GCCGTAGAGC 


69240 




CTGTCCCAGT 


TCTGCAGCTG 


GCGGCTTCGT 


CCTACAGCCT 


CATCCCATTA 


TTCTGCTTTT 


69300 




GAGAAAGAGC 


AGCCCAAGGC 


CCTAGCTGGC 


TTGTGGGGCC 


TCTGGCTTCT 


CCACACCACC 


69360 


25 


CCGAGTTCTG 


CTTCTCAGAG 


TTGTGGGGTC 


CAGAGGCTTT 


GCCCAGAGGC 


GGTGTCCCCA 


69420 




TGGGCTGCTC 


TGGTTTGAGA 


CGCCGGGCCC 


AGCGGGGTCT 


CTCCTCTGCT 


GCGCTCCCGG 


69480 




GTGCTGGGGA 


GGGTGGCTTT 


TGCTGCTTCA 


ACCCTTAGGC 


GACCATAGAG 


CCTCTTTTCA 


69540 




AGTCCCACTG 


ACCCCCTTGG 


AGACTCTGTC 


CCTGCCTGGC 


TTCTCTCCTG 


GCTGCTGGGA 


69600 




AGAGCAGGCG 


AACTGCCCGC 


CCTGAATGGA 


TGCTGCGCTC 


CACCCTGGGC 


CCCCCATTGG 


69660 


30 


GCAGGAGATG 


GAGCTTGGCA 


GTCGGGCTGA 


GCGGGCTCAT 


GCTGGAAGGG 


CCGGGGCTGG 


69720 




GGTCGGGGCC 


TCCCCTGCCT 


GCAGTGTGGG 


TGTCAGCGCC 


CTGCTGCCCT 


CCAGGTGCTG 


69780 




GAGTGTTCAC 


CAACACGGCC 


GGACTGCAGA 


AGCTGGCGGA 


CATTATCCAG 


GTGGGGCCTG 


69840 




CTCCTCTGTG 


GCATCTCCTT 


CCCTGATGGA 


AGCCGGGCGG 


GTGCCTTCTC 


CTGCTGTATT 


69900 




AGTTAACTGA 


TTCTAGACTT 


GGGGATGGGA 


GAAAGGCCCC 


TACACCACCT 


GTTTCTGATT 


69960 
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GGCAAACTCT 


CGGCTCCTTT 


CCAGTGCCCT 


AAACCCACAC 


TGGGCCTCCT 


GCAGGGATGG 


70020 


GGGAGGACGA 


GGTCTGGTGG 


CACATGCCCA 


GGGTGATGCT 


GGTGAGGGAG 


GACGCAAAGG 


70080 


ACAGTGGGGG 


CCGGGGAGCC 


GCTCCTGCCC 


TGTCCGGGCC 


CTCAGGCCAG 


GGGGGACCCA 


70140 


CTGCTGGCAG 


CCCCAGCAGC 


CCCAGCTGCA 


CGCAGATGAA 


GAGCTCTGGA 


CACACGCGGC 


70200 


TTCCTGAACA 


GCTTCTCCAG 


GGACAGACAA 


ATGGGGACCC 


TGCAGGTTCC 


CGGCAGGGGT 


70260 


GTCCCTGGGA 


GCCCATGATT 


GGGGGTGCGA 


CCCTGGCCCC 


CTTCTCATTG 


GCCCCGTCCT 


70320 


GTCCTGCAAT 


GCCCGTCCCA 


TGTGAGGTCT 


GCTTCTGGCT 


CCATGCCTAT 


GGCAGCACCT 


70380 


GCTTTCCCTG 


GCGTAGAGGT 


GCTTGTCCGG 


TTTGTGGAGG 


GCACGCCCCA 


TTTTGGGTGC 


70440 


TCTGGGCACG 


TTGCCTCTCC 


GGGGCCTCGG 


TGGCTTTTTT 


AGAAGCAGAC 


TCAGAAGTCC 


70500 


CTGACTGGGG 


AAGCCAAGGC 


ACAGGTGGCT 


GTGTGGAGCC 


CTGTGAGGCC 


TCCTCTGTGC 


70560 


TGCCCACGCT 


GTACCTGCTG 


GCCACACGAG 


ATCATGGCAG 


GGTTAGGCAG 


GGCTGCCCAG 


70620 


CGCTATGACA 


GCTTCATGAG 


TGTCCATCTG 


GCCTGTGGGG 


TGCTTGAGCT 


GGGGGAGGCC 


70680 


GCAGAAGAAC 


CCTGGGATGC 


ATGGCTGGCC 


TGTGCATGCT 


GCTGGGCATG 


GAGCTGCAGA 


70740 


TCCCGGAACA 


AGCAGGCACT 


GCCTTCTCCT 


TCACAGACGC 


AGCTCTGAGC 


GGGGGCGAGA 


70800 


CCTGGGCAGG 


GACCAGGTGG 


GGTGGGCACA 


GGGTGGTGGG 


GCCCAGGCTC 


AGCCCTCCCT 


70860 


CCACTGTGGC 


CGTCTCTGTG 


GCCAGTGACG 


CCACAGCCTG 


TGTCTTCTCT 


GTGCGGTAGC 


70920 


TGGGGCTGGA 


AGGACAGCAC 


TGCCTTGTCC 


TCCCAACTCC 


TCCCCAAAGG 


CACGGTGGGC 


70980 


ATCCCAGGCC 


CAGACCCCTC 


TGTCTGTGGC 


TCCTGCCTGC 


CAAGGGCTGC 


TGTGCTGTCC 


71040 


CGCATGGAGT 


GTGGTTGGCT 


CTTCAAGCAG 


GAGGCCGTGC 


ACCTATCAGG 


CGGACCTGCT 


71100 


TCCATGTCCC 


TGATGGGTCA 


CTGCAAAGCA 


CCTCCAGCAC 


ATGGCCAGGC 


GAGGTAGCCC 


71160 


TGCAGCCCAG 


GGCCTGGAGG 


GCAGGTGTGA 


GCTGGCCCGG 


GCCTGTCCCT 


CCCTGGAATA 


71220 


CAGCTTCCCA 


GGCTCCCACT 


TATGGAGAAG 


TCTCCTCCAC 


ACTATGGAAC 


TGAATCCTAG 


71280 


AATGTGGCTT 


CTGAGGTTCC 


TACACTCGAA 


CTGAATCCTG 


GAATGCGGCT 


TCCAAGGCTT 


71340 


CCAGCTATGG 


AGAAGACTCC 


ACACTCTGGA 


ACCGAATCCT 


GGAACGCGGC 


CTCCCAGGCC 


71400 


CCCAGCTATG 


GAGAAGACTC 


CACACTCTGG 


AACCGAATCC 


TGGAACGCGG 


CCTCCCAGGC 


71460 


CCCCAGCTAT 


GGAGAAGACT 


CCACACTCTG 


GAACCGGATC 


CTGGAACGCG 


GCCTCCCAGC 


71520 


CTCCCACTTA 


AGGAGAAGTC 


TCCACACTCT 


GGAACCGGAT 


CCTGGAACGT 


GGCCTCCCAG 


71580 


GCCCCCACTT 


AAGGAGAAGA 


CTCCACACTC 


TGGAACCGAA 


TCCTGCACAC 


TCCATCGGTT 


71640 


TGGAATTTCC 


TTTGGCTGCT 


GCTCTAAGTA 


GCCGCTGGTG 


GATGACTCAG 


CTTCTGCCAG 


71700 


CCCTCGGGTG 


CCTGGAGGAT 


GAGGGACTGC 


ACACAGTGCT 


CACCCGCGTT 


GGCTCCTGAG 


71760 


CCCCTGCAGG 


TGTGGGCGGT 


GCCCATAGGG 


CTGGTGCTGG 


GTTGGGCCTG 


CAGCCCTGAG 


71820 


TCACAGGTGA 


CCCTGGGGGC 


AGAGTGGGGC 


CAGTGGCCCC 


AGGAAGAGGA 


TGTGGGATGC 


71880 


ACAGCTCAGC 


TGGAGGCGAA 


CTCCAGGCAG 


GGTCAGGCCG 


TGTGCTCGGA 


AGTCAGGGCT 


71940 


TAGCTGGAGG 


CAAACTCTGG 


GCAGTGCTGG 


CCCGTGTTGG 


GGAACCAGTT 


GCCCCTGGGC 


72000 



-56- 



WO 01/42434 



PCT/US00/33065 





CCCCGTGAGA 


CTGCTGGGTC 


CTCATCCCTC 


TCTGCCTGAG 


GCCGGAGCTG 


CCCTGGGCTG 


72060 




AGGCACAGGG 


GGATTTGTGG 


TGGTGTTTTT 


TTGAGAAAGG 


GTCTCGCTTT 


GTCACCCCGG 


72120 




CTGGAGTGCA 


GGGGCTTGAT 


CACAGCTCAC 


TGCAGCCTCA 


ACCTCCTGGG 


CCCAAGTGAT 


72180 




CCTCTTGCCT 


CAGCCACCCG 


AGGAGCTGTG 


AACACAGGTG 


TGCACCACCG 


CACTCAGCTA 


72240 


5 


ATTTTTAAAA 


TTTTTTTGTA 


GAGATGAGGT 


CTTGCCATGT 


TTCCCAGGCT 


GGTCTCAAAC 


72300 




TCCTGGGCTC 


AGGCAGTCTG 


CCCGCCTTGG 


CCTCCCAAAG 


TGCTGGGATT 


ACAGGCAAGA 


72360 




GCTTCCATGC 


CTGCCCAGCA 


GAAGGCTTTT 


CGAAGGAAGC 


TGTTTCCTGA 


GGCAGACTCA 


72420 




GCCCTGCTCA 


TGGCAGCCAC 


CAGCGTGGGG 


GTGAACTTGT 


TCTGTTACTT 


CCATCCCCGT 


72480 




GGGCCAAATG 


CTTTGGTAAA 


ACACAAGGCC 


CTGTGTTTAG 


CTGTCTTGAC 


AGTGAAAATG 


72540 


10 


GCTGGGAAGG 


AAGGAAGGAA 


CGGAAGGAAA 


TTTCTCTCTC 


CTTCTGTGCG 


TACCCAGGCA 


72600 




CGTGCACATG 


CATGCAGAGT 


ACGCACACAC 


GCACGCACGC 


CTGCACAAAT 


CCACGCATGT 


72660 




TGCCAAGTCT 


CTGTGTTCCA 


GCCGTGGTGT 


CTGCCCCCCG 


GTGTTCTCTA 


GTTCGGCTTC 


72720 




TCCGCATTTC 


TGTGAATGAT 


TCCGGCTTCT 


TGGTGTTCCC 


AGCAGAACTC 


CCTCAAGTCT 


72780 




GCGGCGGGGC 


TCTGACGGCG 


GTGGCTTGGC 


TGACATGGCC 


ACATTGCTGA 


GCCTGTTGGG 


72840 


15 


GGCTTTGCGT 


TCCTGTTCTG 


GCCGTTTTTG 


GCTCGTTTTC 


CAGGAACGGT 


CGTCACGCGC 


72900 




TCCTCTCCTA 


GTGCAGGCAT 


CATTCCTTTC 


CCATTGATTT 


GCAGGGTTCT 


CTGTAAGTTC 


72960 




TGAGGATCCC 


ATATACATAT 


ACTCTCTGTA 


AGTTCTGAGG 


ATCCCATATA 


CATATTCTCT 


73020 




CTCTAAGTTC 


TGAGGATCCC 


ATATACATAT 


TCTCTCTCTA 


AGTTCTGAGG 


ATCCCATGCC 


73080 




GACATACATA 


TTCTTTCCTT 


GTCTCATGCT 


GGTCATTTTT 


TCCATTTTCA 


TGACAGGTTT 


73140 


20 


GGTGAACACA 


TGTTTCCTTG 


TCAGATTTTT 


GTTCTGAGCT 


TGTGCCTCCC 


GACCAAGATG 


73200 




CTAAACCGGG 


TCTTGTGTAT 


TCTCCAAACT 


GCACTGTAGA 


GTGACGGAGC 


TTTGTGTCTG 


73260 




GGCCTCCATG 


CCTTCTGACG 


TCACCTGTGG 


GGGTGTGAAA 


GGCAGACTCT 


ACCTTGATTT 


73320 




TTCCCAGCAC 


GCCACACCGG 


TGGTTCTGTG 


CGCTGACCGA 


GCGGCTCGGC 


TTCCCCCAAC 


73380 




TCCACTGGGC 


ACCTGCCACA 


CTTTTCCTCA 


TGTTTTTGTT 


CACTGTGGTT 


TTGTCGTAAG 


73440 


25 


TCCTGGTGTT 


GGCCTGAACC 


AATTTCTTTT 


TGTTTGTTTT 


TGAGACAGAG 


TTTTGCTCTT 


73500 




GTTGCCCAGG 


CTGGAGTGCA 


GTGGCGCGAT 


CTCGGCTCAC 


TGCAAGCTCC 


GCCTCCCGGG 


73560 




TTCACGCCAT 


TCTCCTGCCT 


CAGCCTCCCA 


AATACCTGGG 


ATTATAGGCA 


CCTGCCACCA 


73620 




CGCCTGGCTA 


ATTTTTTGTA 


TTTTTAGTAG 


AGACGAGGTT 


TCACCGTGTT 


AGCCAGGATG 


73680 




GTCTCGATCT 


CCTGACCTCG 


TGATCCGCCT 


CCCAAAGTGC 


TGGGATTACA 


GGCATGAGCC 


73740 


30 


ACCGTGCCCA 


GCCTGATATT 


TTTAGTAGAA 


ATGGGGTTTT 


GCCATGTTGG 


CCAGGCTGGT 


73800 




CTCGAACTCC 


TGACCTCAGG 


TGATCCTCTC 


ACCTTGGCCT 


CCCAGAGTGC 


TGGGATTACG 


73860 




GGTGTGAGCC 


ACCACGCCCG 


GCCTCTTGTT 


CTTTTGAAAC 


CTGCCCTGAC 


GTTTTTTCCA 


73920 




TAGTGCATCT 


TGGAGTCAGC 


GTGTCTACTT 


CCTGTAAAAA 


TCTTACTGTG 


ATTTTGACTA 


73980 




GAATGTGTTG 


AATTCCTGTT 


TTTTTTTTGA 


GTCAGGGTCT 


CTCTGTTGCC 


CAGGCTGGAG 


74040 
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TGCAGTGGGA 


CCATCACAGC 


TCACTGCAGC 


CTCAACCTCC 


TGGGCTCAGG 


GGATCCTCTC 


74100 




AGCTCAACCT 


CCCAAGTAGC 


TGGGACCACA 


GGCACATGCC 


ACCATGCCCG 


GCTAGGTTTT 


74160 




TTT TTTTTTT 


TTTTTGGTGA 


ACACCCTGGG 


GTTGCACCAT 


GTTGCCCAGG 


CTGGTCTCGA 


74220 




ACTCCTGGGT 


TCGGGCAGTT 


TGCTCCTCTC 


AGCCTCCCGG 


AGTGCTGGGA 


TTACAGGCCT 


74280 


5 


GAGCCACTGC 


ACTAGGCCAT 


GTTGAATTTC 


TAGATTAATT 


TGGGGCCCTC 


AGGGGCACAG 


74340 




AGAGGAGGGC 


TGGGCCAGTT 


GGCGGGAGGA 


GAGGCCCCTC 


GGGCTGCCGC 


ATTTTCAGTG 


74400 




CATGGAGATG 


GCCTATGTTG 


GGGGAACACA 


GAGCTCACCG 


GGGGTCCCTG 


CAGGGAGGAG 


74460 




AAAGGGTCAG 


GCAGGTGCCA 


GCTCCTGTCC 


ATTGGCCTGG 


GGCTGCATGA 


TGGCAGGGGC 


74520 




CGGTGAACCG 


ATGACCCCTG 


GGTGTCCTGT 


GACCTTCTGT 


GTATGCGGCT 


GATGCTGCAG 


74580 


10 


AAAGTCGGGT 


GGCCTCAGGC 


TCCTGACGGG 


GCTGCACTTC 


CTCTGCCTTT 


CAGATTGTGT 


74640 




TCAGTGTGGA 


CCCCTCCGAG 


GGCAGCCCTG 


GTTCCCCAGC 


AGGGCTGGGG 


GCCTTACAGT 


74700 




CCTATAAGGT 


AGGGGCCACC 


TCCAGGAGGC 


AGGTGGAGGG 


CAGCCCTTGT 


TCCCCGGCAG 


74760 




GGCTGGGGGC 


CTTACAGTCC 


TATAAGGTGG 


GGGCCACCTC 


CAGGAGGCAG 


GTGGGGCTGG 


74820 




GGGTCTTCTG 


GTCCTAAAAG 


GTAAGGGGCT 


GCCCCCAGGA 


CATGGGCGGG 


GCCTCCACAC 


74880 


15 


TCCTGGTCCT 


GTCCCCTCCA 


GGTGCACATC 


CATCCTGATG 


CTGGTCACCG 


GAGGACGGCT 


74940 




CAGCGGTCTG 


ATGCCTGGAG 


CACCACTGCA 


GCCAGAAAGC 


GAGGTACAGA 


CCTGGGCCCA 


75000 




CACGCTCCCC 


GCCCGCCCGG 


GTGCAGTGCC 


CGGCACCACC 


ATGCCACAGG 


CTAGGCACAT 


75060 




GCCCAGCCGT 


GGATCTCCTG 


CCCCCATGGG 


CCTGGCCACC 


TTCTCCATAT 


CCAGGCCAAT 


75120 




CCAGAGCATT 


CTCCTCACTG 


TCCCTCTGAA 


GATTGGAGTT 


ACTGAGAGAC 


GTAGGAGATG 


75180 


20 


GCCTGATGGC 


ACCGTGACCT 


GCCCAGAGTC 


ACCTGGTTGG 


TGGTGGCAGA 


GCCACAGCCC 


75240 




AGCCAGGCCT 


CCCTGCTGGG 


ACACGCTCGT 


TTATGCCGAG 


GCCGTCAGCA 


CAGAGCCTCC 


75300 




ACAGTGAGGC 


ACGGCTCTGC 


CTGCTGCCTC 


CACGCAGCGC 


CTGGCCGGGC 


CAAGCCTCAG 


75360 




GGTCACATCT 


GAAGGGGGCC 


CGGCTGGCCC 


TGTTGTCCGA 


AGCCCCTGGT 


GCGCTCAGCC 


75420 




CCGAGGCCCC 


ACGTGCCTTC 


TTGGCTTCCT 


GTGCTCCGTG 


GCGTCTTCGA 


GTCGGTGCTG 


75480 


25 


CCGGGGACGC 


TGTGTGGATG 


GGGTCTGTGA 


GTGTGCCCTC 


GGCTCCGTGT 


CCGGAGCCCT 


75540 




GTGGTTCTTG 


GGGTGTATCT 


GGCCCCACCC 


CCACTGCGTG 


GTGTCCAGGG 


TGGGGCTTCA 


75600 




CGGCTGCAGC 


TGCGGGAGCT 


GCTGCCCCTG 


CCTTGTGCTC 


CAGTGGGGCC 


TTGCCTCTGG 


75660 




GCTTGGTTCG 


TCCCTCTCTG 


GAACATTCTT 


TCTCAGCTGC 


TGTCCGACCC 


ATGGTGGCAT 


75720 




GACGTGGCCC 


TGGCTGAAGC 


AGCCCTTGTG 


CGGTTGCTGT 


GGTTGGGTCT 


GCCTGGCCGA 


75780 


30 


GCCGGAAGGG 


AAGGGCTGGG 


AGGGCGTCAG 


GGTGGCGTGG 


CTTGACCCCC 


GCTCGGTGAT 


75840 




GGTCCTGCAG 


CAAGGCCTCT 


CCCAGCAGGA 


AGCGTCCATC 


CCGGGGGGAG 


GCCGGCGCCC 


75900 




CTCACGCAGT 


TGGGGTTGCG 


GGAGGCAGTG 


CGTGCCTGAG 


GCAGCCGGTG 


CACAGATTCC 


75960 




AAGGGCCTGG 


AATCTGTTTG 


TTCCATTGAC 


CTCTGATGTC 


ACTTGACTTC 


TCAGAAGCAG 


76020 




CCACTCCCTG 


CACTGGGCGT 


TTGTAGGAAA 


TGAGCTCCTG 


GAGGAGGGGG 


TGGGGAAGTT 


76080 
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CCCCCATTGC 


AGGGCACACT 


CAGCCCCAGG 


AAGGAAACGT 


GCCTCGTCCC 


TGCTGACTCC 


76140 




GAATCGCAGT 


CAGAGTCGTT 


CTGCTTGTGC 


CGTGTTGAAT 


TCCCGGCATC 


CGGCATCCAG 


76200 




ACTCAGCCTC 


CTCCCCAGGC 


CACGGCCGCC 


GTGGCCAGTC 


GGTCAAGCCC 


TTCTAGGAAC 


76260 




TTCCTTTGAG 


CTGGCGCCCT 


TGTTCACTGC 


TGACGCCACT 


CAGAGGCTTG 


TGCACGTGTC 


76320 


5 


CTGCTTCCAG 


GCAGAGCTGG 


GAACTCGCAC 


CCCGTCTTCT 


GCACGCGGCC 


GTGGAATGTC 


76380 




GGGATGCCGG 


CGCTTCCTTC 


CCGTGTGCTC 


TTGGCGGGGT 


GGGCTTCTTG 


CCCTGAGCCG 


76440 




CATGTCACAG 


TTTCTGCAGA 


AGTTTAGGGT 


TGGAGTGGGC 


TGACCTCTCT 


GCAGGTGTCC 


76500 




CCAGCCTCTG 


CCTGGGGTCT 


GCCTCCTACT 


CCCAGGACCC 


CCTGTCCCCC 


AGAGGGGCCC 


76560 




CAAGCTGGCA 


GGCTCACACT 


CAGGGCAGCC 


TCCTTTGTTC 


TGACTTCTGC 


ACAGTGGGCC 


76620 


10 


TGGGTGGCTG 


CCCGCGGCTC 


GCTTGCTTGA 


TGCCAGTGGG 


TGGAGAGGGT 


GATGGGCAGA 


76680 




GAGGCAGGTG 


GTCAGGCCCC 


CAGTCCCGTC 


CTCACACTCT 


GTGCCCTCTG 


CCGCCCCCCG 


76740 




CCCCACAGGG 


AAGGTGCTGA 


GCTACTGGTG 


CTTCAGTCCC 


GGCCACAGCA 


TGCACGAGCT 


76800 




GGTCCGCCAG 


GGCGTCCGCT 


CCCTCATCCT 


TACCAGCGGC 


ACGCTGGCCC 


CGGTGTCCTC 


76860 




CTTTGCTCTG 


GAGATGCAGA 


TGTACGGGCC 


ACCCCTGCCA 


GGGCCTGAGC 


ACCGGTGACA 


76920 


15 


CCTCTGACAT 


CAGCGGGGTG 


GAAGTGGTGG 


GGGTCCCCAT 


GAGCCGGGTG 


CTGGGGGTCT 


76980 




CGGGCCTCGA 


GGGCTAAAGG 


GGTGCTGGTG 


CACTTCCCCA 


CTGTCTGCTC 


CCTCTGGCCA 


77040 




CGCTCAGCCC 


TTTCCCAGTC 


TGCCTGGAGA 


ACCCACACAT 


CATCGACAAG 


CACCAGATCT 


77100 




GGGTGGGGGT 


CGTCCCCAGA 


GGCCCCGATG 


GAGCCCAGTT 


GAGCTCCGCG 


TTTGACAGAC 


77160 




GGTGAGGGCC 


TGTCCCTGGG 


CCCTGCTGGG 


GTGGGAGGTG 


GGGGAGCACT 


GAGGCCTGAG 


77220 


20 


GCCCTGAGCA 


GTGGCCTCTC 


CGGCTCTAGG 


TTTTCCGAGG 


AGTGCTTATC 


CTCCCTGGGG 


77280 




AAGGCTCTGG 


GTGAGTGCCC 


TGAATGCCCC 


AGCTGTGCGC 


ATCCTGGATC 


CTGGACCCCT 


77340 




GCTCCCAAGA 


GCTGGTAGGG 


ACCCCTGCAG 


ACATCCTGCC 


CCTGCCTTGA 


CCCCGGCCCC 


77400 




TGCACTTCCA 


GGCAACATCG 


CCCGCGTGGT 


GCCCTATGGG 


CTCCTGATCT 


TCTTCCCTTC 


77460 




CTATCCTGTC 


ATGGAGAAGA 


GCCTGGAGTT 


CTGGCGGGTG 


CGTCTCCCCT 


GTGTTCTGGG 


77520 


25 


CGGGGTGGGT 


GAGGGCAGGG 


CTGGAGCATG 


AAGCAGGCAG 


TGGTCACAGC 


TCCTGCTTGC 


77580 




CCTCATCGGA 


TCGGCGGCGT 


GACCAGGGCT 


GCCGTGTCCC 


TGCCTCTTCC 


TCCCACAGGC 


77640 




CCGCGACTTG 


GCCAGGAAGA 


TGGAGGCGCT 


GAAGCCGCTG 


TTTGTGGAGC 


CCAGGAGCAA 


77700 




AGGCAGCTTC 


TCCGAGGTCG 


GCACTTGGCC 


GGGGCTCTGG 


GCCTGCTGCC 


CCCTCGTGCC 


77760 




TCCCCTGCCT 


CTCACAGCTT 


CCCCAAGGCT 


GACCACTGGC 


CCTGACCATG 


GGCTCCGGCG 


77820 


30 


GCTCCCGCTG 


CCTCTTCAGG 


GCTCCTGCGT 


TTCCTTCCTG 


GCCCTGAGTG 


TTGCCTCTTA 


77880 




TCTTACAAAG 


CCCCCAGCAC 


CGGGTGGGTG 


TGGTAACAGT 


GGCCCTCCTG 


TCTGAGTAGC 


77940 




CCTAGTCGGC 


CACCCTGGCC 


CTGGGGTTCC 


CCGTGTTTTC 


TGGGAAGCAC 


TGAGCAGGCG 


78000 




TGGGGTCAGC 


CTGGGATCCG 


TGCCAGGAAG 


AAGCTTCCAG 


AACCCGATTG 


GCCTTCCTGG 


78060 




CTAGGACGAT 


CCTTCATCTT 


GGAGCATGAG 


ACCTGGGTCT 


CCCTCATGGG 


GGAGGAAGGG 


78120 
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GCTGGGGGGG GGCTCCAGGC 
TATGCAAGGG TTGCCGCCCC 
AAGGTGAGCT CTCCAGGGCC 
ATGGCTCCCC AGCAGACTCT 
5 AGAGCGACTG CTGGCCCTGC 
GAGTTTCCTG GCCACAAGAG 
CCTGACCCAC AGATGGAGCT 
GGCTGGGCTG GGTGTGGGCA 
TGTGTCCCCT AGGCCAGCGA 

10 GTCACGGGCC TCCCGTACCC 
CTGGATGAGA TGAAGGGCCA 
CTGGGGTAAG GCGGTCTGGT 
CGAGCCGCCT CGCCCCACAG 
GGGCTGTGAA CCAGGCCATC 

15 TCCTCTGTGA CCACAGGTGC 
ACACCCCACT GGGCCCCTGG 
TCTCCGGCTA CTCGGGGTCA 
GAACTGAACC CTTGAAGCGG 
CGCCTTTGCC GACGCAAGAG 

20 TGACAACTTT GGCCATGTCA 
TGTGAGTTCC TGCCCAGGGA 
CCACCCACCC CCATGGTTCT 
GTGCGTGGAG AAGATGCTGT 
AAAGCTAAGA GTCTGGACCT 

25 GGGCAGCGCT GGGTGGGCGG 
CAGGCGCCCA GGGCGGAGGC 
GGCTGCCTCT CCCTCCTAGG 
GAACGCCCCA GGCAAGGATG 
CCCACAGGCA GGACCCCAAG 

30 GGACCCCAGC TGGGGCCCCC 
CCAGCTGCCG GGGACCCCGA 
GCCCGGCAGA GGCCCAGGGG 
AGCCCTGGCG AGGAGCAGGT 
CCTGAGGCCA ACCCGACCCC 



TCAGCCTCAC CAACTTTCCT 
TGGGTCCACC GGCGCCACCT 
CTCTGCCCTG ACCTGGTTGC 
GGGCCCTGGG GGCTGCCCGG 
TGGGAGCGTG TCCTGCTCTG 
TTGGAGGTGG CGTCTGGGAG 
TCCTCCCACC CCTGGTTGGG 
CCAGGGAGAG GAGCCCCCAC 
GGGGCTGGAC TTCTCAGACA 
CCCACGCATG GACCCCCGGG 
GGGTGGGGCT GGGGGCCAGG 
GACTGAGCCC CCGCCCCGTG 
TTCCTCTCTG GGCAGGAGTG 
GGGCGAGTGA TCCGGCACCG 
GTGCAGTCCG GTGGCAGGCG 
ACTCTCCTTC CCCACATGAG 
GCGTGGGGCC CCTGCAGCAG 
CTGTGGGCAG GGCAGCAGGG 
CCCAACTGCC CTCCTGGGTG 
TCCGAGACGT GGCCCAGTTC 
GGGGATGAGG GTGTTGTCCC 
GCAGATGCCA GCGCCGGCCC 
CAGCGAGGCC AAGTCGCCTG 
GCATGTCCCC AGCCTGAAGC 
TGTGGGGGTG GCGGAGCGGG 
GACTCACCTG GCTTTGTGCG 
GCAGGGCCCC CACGGGCTGC 
CCCCCCGAGG CTGAGACTCC 
TGCTGGGACT CTCCCCCAAG 
CGTGGGCTTC ACTGCGCACT 
GAGTAGCCTG TGTGTGGAGT 
GCTGCTGGCC GCCCTGGAGC 
ACAGTTCCAG GGCCTTGGGA 
GCCCATCTGG CCTCAGGCAC 



TCCAGACCAT 


CAGTGCTTAC 


78180 


TCCTGGCGGT 


CTGCCGGGGC 


78240 


CTGTTCCCTG 


GTGGGTGCTT 


78300 


TCCCCTCCTT 


GGGTCCCACG 


78360 


GGCCTGGGCA 


GGCAGGATGG 


78420 


CTGTGGACCC 


CAAGTGGGGT 


78480 


GACGGAGCCT 


CGGGGAAGGT 


78540 


GGCCCCAGGC 


AGCTCCCTGG 


78600 


CGAATGGCCG 


TGGTGTGATT 


78660 


TTGTCCTCAA 


GATGCAGTTC 


78720 


TGAGTTACAG 


CAGGGTGGGG 


78780 


GCCAAGGGAG 


CCCCCGTGAC 


78840 


GTACCGGCAG 


CAGGCGTCCA 


78900 


CCAGGACTAC 


GGAGCTGTCT 


78960 


CGGCGCCAGG 


GGACACGCCC 


79020 


GCCCCGTCTC 


CTCCAGAGCC 


79080 


ATGAGGGTCT 


TCACTTCGGT 


79140 


CTATGGCCAC 


CCCCCAGGTT 


79200 


CGTCCCCACG 


TCAGGGTGTA 


79260 


TTCCGTGTTG 


CCGAGCGAAC 


79320 


CAGAGGAGCC 


AGAAATGGGT 


79380 


CCCGGGCTAC 


AGCACCCAGT 


79440 


GCCCCTTCTT 


CTCCACCAGG 


79500 


AGAGGTCCTC 


AGGTGCGGAC 


79560 


CGGCGTGGGG 


CGGGCAGCAC 


79620 


CTTCCCCTCC 


CACCTCCAAA 


79680 


AACCCTCCCC 


TACAGGCAGA 


79740 


CCCCAATAGC 


AGGGAGGACA 


79800 


AGGGGCTTTG 


CCACAGGCAG 


79860 


CGGGTGCCCC 


TGCAGGGTCA 


79920 


ATGAGCAGGA 


GCCAGTTCCT 


79980 


ACAGCGAACA 


GCGGGCGGGG 


80040 


TGGACACAGA 


CCCTCTGTCT 


80100 


CTCCCCACAC 


ACCCCTGTAA 


80160 
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ATCCCCTGCC 


TGGCAGGCAG 


GCGGGCAAGC 


GGGCGGGGGA 


TCCCAGCTGC 


CTGGCTGTCT 


80220 




GTGGGTCCTC 


CACCCCACCT 


CACCCACAGG 


CTGCTGGCTC 


CCAGGTGGTG 


CATGCCCTGG 


80280 




CCCTCCGCGG 


GTGCCCCCCA 


CATCACTTTG 


GTTCTCTGGC 


GGGTCAGCTT 


GGCTCAGTGC 


80340 




ACTCAAGGTC 


GGGTGCCCCT 


GCCACTGGCT 


GCGCTTGAGG 


CTGGCCTTTC 


TCCAGGAATG 


80400 


5 


TGCTGCGGGT 


GGAACCCAGG 


TTCCTTCTTC 


CTTGGGGCCT 


TTTGCCCCAG 


AAGCCCATAA 


80460 




TTCCTCAGGC 


CAACCCGAAA 


TTTTCTCCCT 


GCTTCCTGCT 


GGGAGCCATT 


CCCCTCTTCC 


80520 




TGCCCATCCC 


TGCCCTTCAG 


GCCCCTGGAG 


TGAGCTCCAG 


GTGCAGGCAC 


CAGGCACCTG 


80580 




TGTCCCCTTC 


CTGCCAGCCC 


CTCGCTGTGG 


TCGGACTGTC 


TTCCCTGGAC 


CTGCTCTTAC 


80640 




AAGTCACCAC 


CTGCGAGCCT 


CATGAGCCGC 


TGGTGTGACT 


TGGACAGGAC 


CAAGTTGTGG 


80700 


10 


CACTGTCACC 


GGGGTGTGCT 


GTGCCCCCCT 


CCCCCGACCT 


CCATCTTGGC 


TCAGGGCTCC 


80760 




TTGGGACCAT 


CTTCCCTGTG 


CGTCCAGGTG 


CTTTGGGACC 


CCAGAGTGTG 


TGGTTGGGGT 


80820 




CTGTGTGTGG 


TTGTGAGCTG 


TGTCCTCCTC 


AGGCCCACAG 


CTGCTCCACC 


CTGTCCCTCC 


80880 




TGTCTGAGAA 


GAGGCCGGCA 


GAAGAACCGC 


GAGGAGGGAG 


GAAGAAGATC 


CGGCTGGTCA 


80940 




GCCACCCGGT 


GCGTGAGCTG 


TCCCTGCACC 


TGTGCCGACC 


ACCATAGACA 


CGCATGGGAA 


81000 


15 


CGCAGCCGTG 


GGTGCCCCCA 


GCCACGGCTG 


GTCCCGATGG 


GACCAGGGAA 


TCCACCCCCA 


81060 




GGAGCTGATG 


TCCAGGGCAG 


CTGTGATGCT 


GACGGCCAGG 


GGCTCAAGTG 


TGTGGTTTCT 


81120 




TCTGCAGGGG 


GCTCATGAGT 


CCCAGCTGGA 


ATCAGGCCCC 


ACCCTTGGGC 


AGGTTTGGCA 


81180 




TGGGGCCTGC 


AGCACTGGGC 


TTGGCCCTGG 


CATTTCCCTC 


AAGTGTGGAT 


GCACACCTGC 


81240 




CTCATGTGAG 


GGACACAGCC 


CATTCCTAGC 


CTTGGATCAA 


AGAACGGAGT 


TATAGCCGGA 


81300 


20 


GCCAGGAAGC 


CCCCTGCCTG 


CTGGAAAACC 


CCAAGTGTGG 


CGGCCTTTGT 


CCATGTCCCT 


81360 




TGGCTTCTGG 


GAAGAACTGG 


GTGGTGCCCA 


GGCAGGGCTG 


GTGCCATCAG 


GAAGTGGGTG 


81420 




GCTGCTGAGG 


GGCCTGGGCT 


GGCGAGGGCC 


TGGGTGGGGA 


GTGCCTGGGC 


CGCCCCTGCC 


81480 




TTGGTTTCCA 


CGTTTCCGTG 


TTGGTCTGGG 


GTGTGTAGAG 


AGATGGGCAC 


TGCTCATCCG 


81540 




GAAGCCCCTC 


CTTGTGCGCT 


GCCATCCTGG 


GAGCCTCAGC 


CGCATCCGCT 


GTGGGGCAGG 


81600 


25 


GGGCTTGAGG 


GAGGAGGAGA 


GAGACGGGCC 


ATGCAGGACC 


CCTGGCTTGA 


GGCAGAGCCA 


81660 




ATCTACCCTT 


TGCCCATTCA 


CTGCTCTCAG 


TTCCCTGCCA 


GCCTCTCACT 


GTGTGACCTC 


81720 




AGACGGGCCC 


AGCCCCACAG 


CTTTCTTCCC 


GCAGCCCCTC 


CCTATGTCCA 


TCCAGCCAGC 


81780 




CAGTTTCTCA 


GGCAGCAGCC 


CCACCTCGGC 


AGTCACTGTC 


CCAGGGAACG 


CTCAATGTTC 


81840 




CAAGGAAGGC 


TCTGCAGCCC 


CAGGGACCAG 


ATGATGAGGC 


TGGCCCTGAT 


GGAGCCTCGG 


81900 


30 


GCCTGTGTCC 


TGCAGGAGGA 


GCCCGTGGCT 


GGTGCACAGA 


CGGACAGGGC 


CAAGCTCTTC 


81960 




ATGGTGGCCG 


TGAAGCAGGA 


GTTGAGCCAA 


GCCAACTTTG 


CCACCTTCAC 


CCAGGCCCTG 


82020 




CAGGACTACA 


AGGGTTCCGA 


TGACTTCGCC 


GCCCTGGCCG 


CCTGTCTCGG 


CCCCCTCTTT 


82080 




GCTGAGGACC 


CCAAGAAGCA 


CAACCTGCTC 


CAAGGTGCCC 


TGGCTTGCAG 


AGGCCACCCA 


82140 




CCCTGAGGGC 


AGTGCTGCCG 


CCGCGTGTGG 


GGTGGGGGCC 


ATCTGGGTCC 


AAGGTGGTCT 


82200 
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CTGTTCTCTA GAGAAAAAGG 
CCAGTTTGTG CGGCCCCACC 
ACGAGGCTGT GGCTATCGGC 
CCTGGACCCC ACTGGTAAAT 
5 AGTGTGGGCC AGAGTCCTGG 
CCTCTCTGTT CCCCTATGGG 
ACCACCTCCA CCTTCACCAC 
ACCACCTCCA CCACCTCCAC 
ACCACCTCCA CCACCACCAC 

10 TCCACCTCCA CCACCACCAC 
ACCACCTCCA CCACCACCAC 
GGAGACCCTG TGCAACTCCA 
CAGCCCTGTC CAACTGCCAC 
GCACACGCCA GGGTCCTAGG 

15 GGTGGAGTCC AAGTCTCCAG 
TACAGAGTTC CCCTAGTTCA 
CTTCTGGGCT TGGTACTCAC 
CTCCTGAGTG GGGCTTTGGC 
TCTCCTCTGA TGCCCCCAGC 

20 GAGGTGGGTG AGGCCTGGCC 
AGATGCTCCC GACCAGCTTT 
CCCAAGCTGA CCGTGTCCAC 
CAGGGCAGGC CCCACCTGTC 
GTGCAGCCTA CGACTTGGTG 

25 CCGGGGCTGC CCCTGTGGGG 
GCAGTTGTCC TGAGCAGCTC 
CTCTCAGTCC TCCACCCCAG 
GGCAGCCAAC CACAGTGGGG 
GTGAGCGCCT ACCTGGCTGA 

30 TTGGCAGCGC TGACAGCCTA 
GCCGCCCTGA CCACTGCAAA 
CGTGGGGAAC AGCCGGTGGG 
ATTGAAGCTC CCCGCAGGGT 
ACAGACGTGC ACAGACCTGA 



GGCAGATGGG GACAGACGCC 
ATAAGCAGCA GTTTGAGGAG 
CTGAGCACAG CATTCCCCGA 
GGGGCCCCAG GTGGGACCCT 
GCTGCTTGGG GTGGGCATCC 
AGTGATGGGG GCCTCCACCT 
CACCACCTCC ACCACCACCA 
CACCTCCACC ACCACCACCA 
CACCACCACC ACCTCCACCT 
CACCTCCACC TCCACCACCT 
CACCACCTCC ACCTCCACCA 
TGCACAGCCC TGTCCCTGCC 
ACGTCCCCTG CCTCCCATGC 
GTCCTAGACC CCTGTCCTCC 
AGGCGGAAGC ATCTGTGTTC 
CCCAGGGGGG AACCTAGCCT 
TGGGATATCC TCATGCCTGC 
CTGCCCGCCA CTGTTCCAGC 
ACCCAGGCGT GTACCTGCCT 
TCCCTAGCCA GCCCTGCCCC 
GTGGCTCTAC ATCTCTTCAT 
GGCTGCAGCC CAGCAGCTGG 
GCCCAGGCCA CCCCCAACAG 
GGTCCCTCAG TGGCTTCACG 
AGCCATCTCA TGGTGGGGAC 
TCCAGGAGTT CCTGGAGGAA 
CGCCACTCTG AGCCATGCTA 
GTCTGGAGTG CCCAGAGCAG 
TGCCCGCAGG GCCCTGGGGT 
TAAGCAAGAC GACGACCTCG 
GCCAGAGGAC TTCCCCCTGC 
GTGGGGGGCA GGGGACAAAA 
TCAGCATGTT TGTGCGTCCA 
CCGGCCGGCC CTACCCGGGC 



CCTTCCTCTA 


CAGGCTTCTA 


82260 


GTCTGTATCC 


AGCTGACAGG 


82320 


AGGCAGCGGG 


CACAGCCGGT 


82380 


CAGACTCCTG 


CGTGGAAGGC 


82440 


TCGGGCCCTG 


CTTGGCCCCG 


82500 


CCACCACCAG 


CACCAGCAGC 


82560 


CCTCCACCAC 


CTCCACCTCC 


82620 


CCTCCACCAC 


CACCACCACC 


82680 


CCACCACCTC 


CACCACCACC 


82740 


CCACCTCCAC 


CTCCACCACC 


82800 


GCAGCAGCAT 


CACTTGTTGG 


82860 


ATAGCCCCGA 


CCCCTAAGCA 


82920 


ATGGTCCTGG 


GGGGTCAACT 


82980 


CTGTTTCTGC 


CTCTGTTTGG 


83040 


GTGTGTTAAT 


GAACAGCCCC 


83100 


GTTGGGACGA 


CCCCAGATCC 


83160 


ACCCAGCCTA 


CGGCTCTGAG 


83220 


CCCCATCCAG 


CAGGCTGGTG 


83280 


GGGTTTTCCC 


GCCCTGGTCT 


83340 


CCCACCCCAG 


GGAACTTTCC 


83400 


CAGGAAGAAC 


GGCGCCGGAT 


83460 


ACCCCCAAGA 


GCACCTGAAC 


83520 


GTAGCTGACT 


CCTGAACCGT 


83580 


AGGCTAACTC 


TTGAGTGTGG 


83640 


TGCTCCCGGT 


TCTGCACCCC 


83700 


GGGCGGGCAG 


GGCGGTGGGA 


83760 


CTCCCACACC 


AGGAGACCCT 


83820 


GGAAGCAGGG 


CCAGCACGCC 


83880 


CCGCGGGCTG 


TAGCCAACTC 


83940 


ACAAGGTGCT 


GGCTGTGTTG 


84000 


TGCACAGCAA 


GTGGCCCTGG 


84060 


TGGGGGCTGT 


GCCGGGTCTG 


84120 


CACCACAAGC 


AGCGCTTCTC 


84180 


ATGGAGCCAC 


CGGGACCCCA 


84240 
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GGAGGAGAGG 


CTTGCCGTGC 


CTCCTGTGCT 


TACCCACAGG 


GCTCCCCAAC 


CAGGTAGGGC 


84300 


ACCTGCCTGG 


CTGCTCCTGG 


CAGCGCCCCA 


ACCGCACGCA 


GCCCTGGGAG 


TGAGCAGCAA 


84360 


AGCCCCAGGC 


CCCCCTCAGA 


CTCAAGTCTC 


TGTCTCCAGG 


CCCCTCACGG 


TCCGAGAAGA 


84420 


CCGGGAAGAC 


CCAGAGCAAG 


ATCTCGTCCT 


TCCTTAGACA 


GAGGCCAGCA 


GGGACTGTGG 


84480 


GGGCGGGCGG 


TGAGGATGCA 


GGTCCCAGCC 


AGTCCTCAGG 


ACCTCCCCAC 


GGGCCTGCAG 


84540 


CATCTGAGTG 


GGGTGAGCCT 


CATGGGAGAG 


ACATCGCTGG 


GCAGCAGGCC 


ACGGGAGCTC 


84600 


CGGGCGGGCC 


CCTCTCAGCA 


GGCTGTGTGT 


GCCAGGGCTG 


TGGGGCAGAG 


GACGTGGTGC 


84660 


CCTTCCAGTG 


CCCTGCCTGT 


GACTTCCAGC 


GCTGCCAAGC 


CTGCTGGCAA 


CGGCACCTTC 


84720 


AGGTTGGTGC 


CTGGCCACTA 


CAGTTCCTGC 


TGGGTGTAGC 


CCCAGGTGAT 


GGGCTGAGGG 


84780 


GGAAAGGGCA 


GGCCCTTGTC 


CTGGTGGCAA 


CGCCTGGCAG 


ACGTGTGCAG 


TGGGCCGGTT 


84840 


GTCTCACAGG 


CCTCTAGGAT 


GTGCCCAGCC 


TGCCACACCG 


CCTCCAGGAA 


GCAGAGCGTC 


84900 


ATGCAGGTCT 


TCTGGCCAGA 


GCCCCAGTGA 


GTGCCCACGG 


AGGCCCCCAG 


CACACCCAAC 


84960 


GTGGCTTGAT 


CACCTGCCTG 


TCCAGCTCTG 


GTGGGCCAAG 


AACCCACCCA 


ACAGAATAGG 


85020 


CCAGCCCATG 


CCAGCCGGCT 


TGGCCCGCTG 


CAGGCCTCAG 


GCAGGCGGGG 


CCCATGGTTG 


85080 


GTCCCTGCGG 


TGGGACCGGA 


TCTGGGCCTG 


CCTCTGAGAA 


GCCCTGAGCT 


ACCTTGGGGT 


85140 


CTGGGGTGGG 


TTTCTGGGAA 


AGTGCTTCCC 


CAGAACTTCC 


CTGGCTCCTG 


GCCTGTGAGT 


85200 


GGTGCCACAG 


GGGCACCCCA 


GCTGAGCCCC 


TCACCGGGAA 


GGAGGAGACC 


CCCGTGGGCA 


85260 


CGTGTCCACT 


TTTAATCAGG 


GGACAGGGCT 


CTCTAATAAA 


GCTGCTGGCA 


GTGCCCAGGA 


85320 


CGGTGTCTTC 


GTGGCCTGGG 


CTTGGTGGTG 


GGAGTTGAGG 


GACAGGGAGT 


TGGCAGAGGC 


85380 


CCCTCCCAGC 


CTGCCATGTG 


ACACTGTACT 


TCCTCCACGG 


TGGGCTCAGC 


CCTGCCCTCA 


85440 


TCCTCACAGC 


CGCAGCCAAG 


CTGCAGTTGG 


TAGGGGATCC 


ACCGACACAC 


CAGGCTGCCT 


85500 


GGGCTGGTCT 


CTGGGTTGGG 


AGCTGCCCCA 


GGTGCTGAGG 


AGGGCAGCTC 


CCTGGCTGGT 


85560 


GAGGCCCCTC 


CCAGAACCAC 


CCTTGGACTG 


AGCTCTGGGG 


AGGGATGGTA 


CCAGGTGGGT 


85620 


GAGGGGGGCT 


GCCTGGGGAG 


GGAGGGGTTC 


CTATGGGGCG 


TGGCGAGGCT 


GGCCCAGCCC 


85680 


TCTCCCCGCC 


CATATATGTA 


GGGCAGCAGC 


AGGATGGGCT 


TCTGGACTTG 


GGCGGCCCCT 


85740 


CCGCAGGCGG 


ACCGGGGGCA 


AAGGAGGTGG 


CATGTCGGTC 


AGGCACAGCA 


GGGTCCTGTG 


85800 


TCCGCGCTGA 


GCCGCGCTCT 


CCCTGCTCCA 


GCAAGGACCA 


TGAGGGCGCT 


GGAGGGGCCA 


85860 


GGCCTGTCGC 


TGCTGTGCCT 


GGTGTTGGCG 


CTGCCTGCCC 


TGCTGCCGGT 


GCCGGCTGTA 


85920 


CGCGGAGTGG 


CAGAAACACC 


CACCTACCCC 


TGGCGGGACG 


CAGAGACAGG 


GGAGCGGCTG 


85980 


GTGTGTGCCC 


AGTGCCCCCC 


AGGCACCTTT 


GTGCAGCGGC 


CGTGCCGCCG 


AGACAGCCCC 


86040 


ACGACGTGTG 


GCCCGTGTCC 


ACCGCGCCAC 


TACACGCAGT 


TCTGGAACTA 


CCTGGAGCGC 


86100 


TGCCGCTACT 


GCAACGTCCT 


CTGCGGGGAG 


CGTGAGGAGG 


AGGCACGGGC 


TTGCCACGCC 


86160 


ACCCACAACC 


GCGCCTGCCG 


CTGCCGCACC 


GGCTTCTTCG 


CGCACGCTGG 


TTTCTGCTTG 


86220 


GAGCACGCAT 


CGTGTCCACC 


TGGTGCCGGC 


GTGATTGCCC 


CGGGTGAGAG 


CTGGGCGAGG 


86280 
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GGAGGGGCCC 


CCAGGAGTGG 


TGGCCGGAGG 


TGTGGCAGGG 


GTCAGGTTGC 


TGGTCCCAGC 


86340 




CTTGCACCCT 


GAGCTAGGAC 


ACCAGTTCCC 


CTGACCCTGT 


TCTTCCCTCC 


TGGCTGCAGG 


86400 




CACCCCCAGC 


CAGAACACGC 


AGTGCCAGCC 


GTGCCCCCCA 


GGCACCTTCT 


CAGCCAGCAG 


86460 




TTCCAGCTCA 


GAGCAGTGCC 


AGCCCCACCG 


CAACTGCACG 


GCCCTGGGCC 


TGGCCCTCAA 


86520 


5 


TGTGCCAGGC 


TCTTCCTCCC 


ATGACACGCT 


GTGCACCAGC 


TGCACTGGCT 


TCCCCCTCAG 


86580 




CACCAGGGTA 


CCAGGTGAGC 


CAGAGGCCTG 


AGGGGGCAGC 


ACACTGCAGG 


CCAGGCCCAC 


86640 




TTGTGCCCTC 


ACTCCTGCCC 


CTGCACGTGC 


ATCTAGCCTG 


AGGCATGCCA 


GCTGGCTCTG 


86700 




GGAAGGGGCC 


ACAGTGGATT 


TGAGGGGTCA 


GGGGTCCCTC 


CACTAGATCC 


CCACCAAGTC 


86760 




TGCCCTCTCA 


GGGGTGGCTG 


AGAATTTGGA 


TCTGAGCCAG 


GGCACAGCCT 


CCCCTGGGGA 


86820 


10 


GCTCTGGGAA 


AGTGGGCAGC 


AATCTCCTAA 


CTGCCCGAGG 


GGAAGGTGGC 


TGGCTCCTCT 


86880 




GACACGGAGA 


AACCGAGGCC 


TGATGGTAAC 


TCTCCTAACT 


GCCTGAGAGG 


AAGGTGGCTG 


86940 




CCTCCTCTGA 


CATGGGGAAA 


CCGAGGCCCA 


ATGTTAACCA 


CTGTTGAGAA 


GTCACAGGGG 


87000 




GAAGTGACCC 


CCTTAACATC 


AAGTCAGGTC 


CGGTCCATCT 


GCAGGTCCCA 


ACTCGCCCCT 


87060 




TCCGATGGCC 


CAGGAGCCCC 


AAGCCCTTGC 


CTGGGCCCCC 


TTGCCTCTTG 


CAGCCAAGGT 


87120 


15 


CCGAGTGGCC 


ACTCCTGCCC 


CCTAGGCCTT 


TGCTCCAGCT 


CTCTGACCGA 


AGGCTCCTGC 


87180 




CCCTTCTCCA 


GTCCCCATCG 


TTGCACTGCC 


CTCTCCAGCA 


CGGCTCACTG 


CACAGGGATT 


87240 




TCTCTCTCCT 


GCAAACCCCC 


CGAGTGGGGC 


CCAGAAAGCA 


GGGTACCTGG 


CAGCCCCCGC 


87300 




CAGTGTGTGT 


GGGTGAAATG 


ATCGGACCGC 


TGCCTCCCCA 


CCCCACTGCA 


GGAGCTGAGG 


87360 




AGTGTGAGCG 


TGCCGTCATC 


GACTTTGTGG 


CTTTCCAGGA 


CATCTCCATC 


AAGAGGCTGC 


87420 


20 


AGCGGCTGCT 


GCAGGCCCTC 


GAGGCCCCGG 


AGGGCTGGGG 


TCCGACACCA 


AGGGCGGGCC 


87480 




GCGCGGCCTT 


GCAGCTGAAG 


CTGCGTCGGC 


GGCTCACGGA 


GCTCCTGGGG 


GCGCAGGACG 


87540 




GGGCGCTGCT 


GGTGCGGCTG 


CTGCAGGCGC 


TGCGCGTGGC 


CAGGATGCCC 


GGGCTGGAGC 


87600 




GGAGCGTCCG 


TGAGCGCTTC 


CTCCCTGTGC 


ACTGATCCTG 


GCCCCCTCTT 


ATTTATTCTA 


87660 




CATCCTTGGC 


ACCCCACTTG 


CACTGAAAGA 


GGCTTTTTTT 


TAAATAGAAG 


AAATGAGGTT 


87720 


25 


TCTTAAAGCT 


TATTTTTATA 


AAGCTTTTTC 


ATAAAACTGG 


TTGTAGTTGC 


ACAGCTACTG 


87780 




GGAGGGCAGC 


CGGGGACACC 


TGAGCCGCCC 


GCTGTGCCCA 


GATCCCTCAG 


GCTGCCTGCC 


87840 




ATCAGAACTG 


CTGCCCGGGG 


CTTCCCCTAC 


CTCAGACAGA 


CCCTCCCTGG 


GAGGATCAGT 


87900 




GGGGAGTGCC 


ACCTCTGCCC 


CCAGTGGCTG 


TGGCACGTGG 


CAGGGGCCCC 


TGAAGCTCAG 


87960 




CGAGGGTCAG 


GGCCTGGGAG 


GGTATCATTG 


CTGGAAGAAC 


AGGATGGGGC 


TCAGGCCAGC 


88020 


30 


CCTAGTCGCC 


GGGGCCCACA 


CTAACCCCCC 


ACTTATGAAT 


TCCTCCCACT 


CCCAACTCAC 


88080 




AGGGGATTTC 


CCGAGAGGGG 


ACCTGCCAAA 


GACCTCCTCC 


AGGCCTCCCA 


TGCTTCCCGG 


88140 




GAAGTGAAGC 


TTCTCCCCCT 


CTGGGGCAGG 


CTCTGAAGCC 


TCCCGATGCA 


CCCAGAGCAA 


88200 




CCAGGGGGCT 


GCACCAGCCA 


CTCGCCTCCC 


CAGCACGGCC 


AGGTTCCCGG 


GGCTGGAGGT 


88260 




CCCCCCCAGG 


TCCTGGGAAC 


CAACCTGCAG 


AACACACACA 


GGGTCCCCTG 


GAGAGGACGC 


88320 



-64- 



WO 01/42434 



PCT/US00/33065 





GGGGACTTCC 


AGGGCCCGAC 


TCCTGTGAGT 


CACAGCCCCG 


CAGCTGCTGC 


GCCACCCCCA 


88380 




CCCTGACTCA 


TGCCCCTTCC 


CAGCAGCTCC 


TCCCAGGACC 


CCATGTCCTT 


CCCACATCCG 


88440 




CAGGAAGGGA 


GTGCCTGGAC 


TCTCCAGGCC 


CACCTGGGGA 


GCCCCTCACC 


TGCCCACCAG 


88500 




CCCCTGAGCA 


GCCCAGTAAC 


ACCATCACCG 


TGTCCAACAG 


CCAGGAGCCT 


CCACCCTCCA 


88560 


5 


GGAGGGAAGG 


GATGGACAGA 


GCCACACTCG 


CCGTCTTTAT 


TTTGCACTCA 


CCCTGGGTGA 


88620 




CACTGGGCAG 


GCCGCTCCTG 


CCCACAGCCA 


GACTGAGGAA 


GAACACAGCA 


CTCGGCAGGC 


88680 




CCAGTGGGGT 


CCGTGCAGGG 


AGGACCCCAG 


GACCAGCCTT 


ACTCCCGAGC 


AGGGGACACA 


88740 




GGGCCCCACA 


GAGAACCCCT 


CCGGGAGGTT 


CTCTCCTGGC 


TGGGGGAGGG 


CTCTGGACCC 


88800 




CCACAAACAC 


TCCCCAACTT 


GCGGGGCTGG 


GGCATAAAAA 


CAGCCACTCC 


CAGCAGGCCC 


88860 


10 


CCTCAGCTTT 


TTGCATCAGT 


CAGCTCCCTC 


CCGGGGGATT 


AGGGTGAGGT 


GAAGCCAGGC 


88920 




CCAGGCGTGG 


GGTATAGGTC 


TTCCCCCGCA 


GGCCTCAGCC 


CTGTCCCGAG 


GCTGCATCAC 


88980 




AATCCAGGGC 


CCCCGCTGGC 


CTTTGGGAAC 


ATGGCCTGGG 


TCTTCCTCAA 


GGCAAGATCA 


89040 




GCCCCAGACC 


ACTTCCGGGG 


TCACGGGGTC 


ACAGGGCAGA 


AGCCAGATGG 


CAGCCATGGC 


89100 




TGACGGGCCT 


CCTCCTCGAT 


GGGGCGGAGA 


CAGCCACGGG 


GTCTCCCGAG 


GGTCCCACAG 


89160 


15 


GGCTGTCCTC 


ATGCAGCCCA 


AGCCAGCCTG 


AGCACTGGAG 


CCCCAATTCC 


CAACCAGGTC 


89220 




TCCCTCAGAC 


CCCCCAGAAA 


GGGCCTCGAA 


AGGCCGCCGC 


TGCGCCCTGT 


GGAAAGGCTG 


89280 




CCGCTGCAGG 


GCCTGGGCCA 


GCCGGGCTGC 


CAGACTCCCC 


TCCAAAGCCT 


CCGGATGCCT 


89340 




ACGCTTTTCC 


AGACATAGAG 


GAAAGTTTGT 


CTTCGAGAAA 


ACAAAGTAAA 


TAGAAGAACC 


89400 




CCAAAGCAAA 


GCAAACCCAC 


CCCCCAGATC 


AGCAGCATGG 


GAGCCAACAG 


GAGGCCACTC 


89460 


20 


CTCCAGCACC 


AGGGGACCAG 


CCGTCCCGAC 


GGCAGCGCGG 


CTGCGCCTAC 


GTGATGTCCC 


89520 




TCTGCCGCGG 


CGGCCGGTGC 


ACATTCCGCA 


CGACACACTT 


CACCATCCAC 


TCGATGCCCT 


89580 




CGCGCACCCC 


TTTGCTGTGA 


AGACAGCGGG 


TGTGAGGCGG 


GGGGTCTCGG 


TCCCCAAAGC 


89640 




CCCCGCAGGT 


GCAGCCCCCA 


CTCACCCTGT 


GAGGGCCGAG 


CAGGCCTGGG 


TCAGGCAATC 


89700 




GCGCCTGCCG 


ATCTTGCTGG 


TGCAGTCGCT 


GAAGGCCGTC 


TTGATGTCAG 


GGATTGAGAG 


89760 


25 


GCACGTCTGG 


GGGAGGTAAG 


GCCGTGAGGA 


GCAGCCCCCA 


CGTCTGGCCC 


TGTCCTGCCT 


89820 




GTGGGCCCGG 


GACTCTCAGA 


AGGGCGTATG 


CCCTTCACCC 


CAGGGAAACA 


GCCAGAGCTC 


89880 




CACCAGGGTC 


CCAGTGTCTC 


CCACAGAGAC 


CACAGCAGTG 


AGGACCCTGT 


GCTCAGCCCG 


89940 




AGGCTGAACA 


TGGCTGGTAG 


TGCCTGAGAC 


AAACTAGACG 


TCCACACGGC 


TCCAAGGAGT 


90000 




CCACCCCCCA 


TCCCCTCCCT 


GGGGGACACC 


CTGAGCCCCG 


AGGTGGGGCG 


CTGAGGACTG 


90060 


30 


AGGCCTCCTG 


GGCAGTGGCG 


GAGGCAGGTC 


CCAGGGGCCC 


ACACAGCCGG 


GGATGATGGA 


90120 




GAGGTGGGAG 


CCCTGCATCA 


GTGATGGGGG 


CAGTCTGCAG 


TCATGGTGGC 


TTCTGCTCAC 


90180 




AACCACCTGC 


CCAGTCTTCA 


AAAAGCAGCC 


CTCCCCTCCC 


CTTTTCCTCC 


GAGGGGAGAC 


90240 




CCCTGCCCCG 


TACCAGATGT 


CCCTCTTGTC 


GGCTGAGATT 


GTAGGGGAGG 


CCAGCCTTAC 


90300 




AGGCTGGGGG 


CAACAGAGCC 


ACCCCAGAGA 


AGGCAGGAAG 


TGAAGATTCA 


CCCGGCCCTC 


90360 
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TGGACGCCGG 


GCTGCTTCTG 


TGCAAAGCCA 


CTCCAAGAGA 


ACAGCTAGAA 


CTCAGCGTGG 


90420 




CCAGTGCTCC 


CGGGGGCAGT 


GGCACCTCAG 


AGGGGTCTTG 


AGGGGCTGCC 


CTGGGGGTGG 


90480 




GGCTGGCACA 


GATGCCACCT 


CCAAGGGTAG 


CAGGAACAGG 


TAAGGGTCAG 


AGCTGACTCC 


90540 




CACCAGGGCC 


CCAGCATCAC 


TTCTTTGAGC 


TCTGAGTTTC 


ACCTGGGTGT 


CCCCACAGCT 


90600 


5 


TGGCCACACA 


CTCCTGAGAC 


ACGGCCGCCC 


TCCTGGGGAG 


AGGTGCCCTG 


CATAGCAGGA 


90660 




AGAGGCCTCT 


GGGCGCCTGC 


CCTGAGGTGG 


GAGAACCTCC 


AGGGCTGGCA 


GCAGCAGGTC 


90720 




TGGAGAGGAA 


CCAAGCTTGG 


GAAGCTGCTG 


GGGGCAGGGC 


AGGCCTTGAG 


AATGGCTCTG 


90780 




TACCCCCTGG 


GCAGTCACTG 


GGCCTGGGGT 


GTCTGGGTGC 


ACACCTACTC 


CCCTTGCTGT 


90840 




GGGGGAGGCT 


GGGGACTCGG 


GAAGCTGCTG 


CGGGAGGCAG 


GGGTGGGGCT 


CACCTCCACA 


90900 


10 


TCCTGCTTGT 


TGGCCAGCAC 


CAAGACGGGG 


ACACCGCACA 


GCGCCTCGCT 


GGTCACCACC 


90960 




TTCTCTGGGG 


AGGGCAGGAG 


AGGCAGCGCC 


TCACACCCAG 


CATCCTGCCT 


CTGACTGCCC 


91020 




AGGGGCCCAC 


AGGCGTGGAC 


ACTGTGACAG 


CCACTCCCTC 


TGCCCCCCCC 


CCGTCACCCA 


91080 




CTAGGCAGGA 


GCACTTCTGA 


CCAGACACTG 


AGCCTGCCCC 


AGGCACAGAG 


CTGCCCAAGC 


91140 




TGGACCTGCC 


CCCACTCACC 


ATCCATCCCT 


CCCAGAGCAG 


CCAGGCCGCA 


CTCACCAAAC 


91200 


15 


GCCTGCTTGG 


ACTCAGCCAG 


CCTCTCCTCG 


TCGGTGGAGT 


CAATGACGTA 


GATGACGCCG 


91260 




TGACACTCCG 


CATAATACTG 


GGAGGAAGCA 


CCAGGAGTTG 


GGGCTCAGTC 


CCCACCCTGC 


91320 




CAAGGGCCAG 


CAGAGCCAGG 


CCTGTGTCAT 


GGCCACAGTG 


AGGGGCTCAC 


ATGAGGAAGG 


91380 




GGCAAGAGGG 


CAGCCCCCAA 


CTGCAAGACC 


CTTCTGGGAT 


GCATTCTGGG 


GTTGCGGGGA 


91440 




GATCTGGTGG 


AGGTGTCCCC 


AGACGCTGCT 


CCTGAGAACC 


TGCCGGCAAC 


CTTTGGCCTG 


91500 


20 


ATGGTGGCCA 


AAGGTGAAAG 


ACAGGGATTG 


GGCCAGGCGT 


GGTGGCTCAC 


ACTTATTATC 


91560 




CCAACACTTT 


GGGAGGCAGA 


AGCAGGAGGA 


TCACCTGAGC 


CCACTTCACG 


GCCAACCTGG 


91620 




GCAACACAGT 


GAGACTCCGT 


CTGTACAAAA 


GCTTATGGTA 


ATGTGCGCCT 


GCAGTCCTAG 


91680 




CTACTCGGGA 


GGCTGAGGTG 


GGAGGATGGC 


TTGAGCCTGG 


GAGGTTGAGG 


CTGTAGTGAG 


91740 




CTCTGATCAC 


ACCACTGCAC 


TCCAGCCTGG 


GTGAGAATGA 


GAGACCCTGT 


CTCAAAAAAA 


91800 


25 


AGATAGGGTT 


TGGGGGCTGG 


AGGAACCTAG 


ACCACAGCCT 


GGCCCGTTGA 


GGGAGTGCAC 


91860 




CTGTGGGGCT 


CTGTGCCAGC 


ACCTCGCACA 


GGGAGGGAGT 


GTGGCCATGC 


GGATAAGACT 


91920 




GACCAGCACC 


ATCTACGAAG 


CGAGCCTTCC 


CTGCCAGGAC 


AGGGCCAGAG 


TCACTGAGCT 


91980 




CAGACCTCTG 


CAGCCTGGGC 


TGGTCAGTCC 


TGGGCTCGCT 


GGCAACACTC 


CTGGGCAAGA 


92040 




CAGGGCACAG 


CCCCTGCAGC 


CTCAGGTACA 


AGTGCTGAGC 


CCTGGACCAG 


ATGAGTGCAC 


92100 


30 


CTCTATCTCA 


ATCAGAAAAA 


AACACAGCAA 


ACTCCGCGTC 


CACGTGGAGC 


AGACAACAGC 


92160 




TCACATTTGC 


CACTTTGCCT 


CCAGGCTGTG 


CCAGCTCTCC 


TGTCCAGGCA 


TGAGTGCCCA 


92220 




GAGACCTAGA 


ACTGGATGCT 


GACCAGGTAG 


GACAAGCTGG 


TGGTCAGTGT 


GTTAAGACAC 


92280 




ACACACCCGA 


GAGCATGAGA 


AGCCAGGAGG 


CACAGCCCAA 


CTCTCCGAAA 


TCCTTAGGGT 


92340 




GTCTGAGCAG 


GGAGTACCAG 


ACAACCCCAT 


CCCAGTGCCA 


GACAAGCTTG 


TGCACCTGCA 


92400 
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CTTCCCACAG AGGAGAGAAG 
CAAGGCCAGG CCCCCCCACC 
GCGCCCTCCA GGGTTCTGCA 
ACAGACAGGT CCCACACAGA 
5 TCTGCTGCTG ACACAAGGGA 
GCGGCTCTGT GGAGGCTGCA 
TCTGAGCAGG GTGGCAGAAG 
CCAGAGCCAC CCATCACACC 
ACGGGAGCGA CACCTCAGGG 

10 GGCGTCACAG CAGTGGTCAT 
CAGCGTGGCT CCTGGCTGGC 
GAGTTTTTCT CCCCAGCAGC 
AGACTAGTGT TACAGCAAAG 
ACACCAACCC AGGGTCCAGC 

15 GTGCTGGCTC TGCACAAGGA 
TCCGGTCCCT CCCAGTGACC 
AGCAGGGTCA GGGGTCCCGG 
GAGCCACAGC TGCCTCATGG 
GGAGATGGGG CTGCCATCCC 

20 CCACCACGGT GGTGTCTGGA 
TGGTAGTGAA TTGGGAGGGC 
GATGGGCTAC AGTGGCCCCC 
CAGGACCTTG GAGGAACCCC 
GTGGCCAGTG CAGGCATCTC 

25 ACCAGCAGAT GGTCTCATTT 
TGCCAGGTCA CACCTGCCCA 
GTCCCGGCCA GCCTGGCCCC 
CTGGGCCAAC CACAGCTCTG 
CCCTGCGTTC CTGGTGCCGA 

30 TGTGCCTGAG ACACCGGCCT 
CTGCGCACTG GACACACAGG 
AGGATCTATC AGAGATCACT 
AGGAGGGGAC ACTTTGGGGG 
ACCTGGGCAC TCAGCTCTAG 



CCTGTGCACC TGCACTTCCC 
CCCAGGAACT TCCCACAGTG 
GGTAGCGAGG CCCCCCCACC 
GGCCGCCAAC CCCTCAAGGG 
GCAGGCGGAC CCTAAGGTGG 
GCAAGCCCAG TGAGAGAATC 
GTTCATGTGC AACCGGGTCC 
TACCAGGCAC AAGGTGCACA 
CTGAGTGCGG GCAGGACCCA 
TGTGGGGTGC CCCACAAGGA 
CAGCTGACCC CAGTGGAGCA 
AATGGGAGCT CCCCAACTGC 
AACCCAGGAG CTGAGGTCCT 
CCCAGGACGA GGCCAATTCG 
TGCAGGATAC AGGAACCAGG 
TAGGGGGGTC CCTGCAGCTG 
CCACTAGAGC AGCACATACT 
GCATGACTTG CACCTCATGT 
ACAGCTGTCC CACAGCTGGG 
GAAGGGCTCA GACTGCCAGG 
ACTCAAGGAA GGGCTGTGCA 
TTAAGTCTCC CTCTTCCAGT 
TCTCCAACGT GGAAGTGTGG 
TGGCCCCACT GTATTCTTGC 
TGGTTTCTGT GGGACCCACT 
GGGCAGCCCA CACTGGGGAC 
ACATGGCACA GCATAGCCAG 
CGGATCCTGC TGCCTGAGTC 
CCCCCAGTGT GCATCCTGTA 
GTGACAAGAT CATCATCATC 
CCCTGACTCA ACTTGTCCTG 
TGGCCATGGG AGAAATGTCT 
CGCCTAGGAA AGGGGAACTA 
AGATGGCATC AGGGCAGCCA 



ACAGTGGAAA 


GGAGGAGGCC 


92460 


GAGAGGAGGC 


CCAAGGCCAG 


92520 


CCCAGGAACT 


TCTCTGGCCT 


92580 


ACCCTGCAGT 


GTGCCGGCTG 


92640 


AGACCTCTGT 


GGCAGGAGGG 


92700 


TCCACGTGGC 


TCCTGGGGCT 


92760 


TGGACCATGG 


GACCACGTGG 


92820 


GCCCAGCAGG 


GCCGCAGTGG 


92880 


GAGCCCCACG 


CCCCAGTGGA 


92940 


GGGGGAAGAG 


GGAGGTGTCC 


93000 


GTCAGAGGGA 


CTGTGGGTCT 


93060 


AAAGTGCCAG 


CCAGCCTGAG 


93120 


GGCACATGCC 


ACACATGTGG 


93180 


CAATGACGCC 


CCTTTCTGTG 


93240 


GTGGGAGCAG 


GGGCCTCCCT 


93300 


ATCCTCCCAG 


CTCTGAGCTC 


93360 


CAGCAGACAC 


GCTGAATGAC 


93420 


CTAGGAGACC 


CTGGTGGGCA 


93480 


GACCCAGGGA 


GCCACTGGCC 


93540 


AAGTCGCACC 


CCAGCAGAAG 


93600 


GCCCCAAGAC 


CAGCAGCAAG 


93660 


TTCGCCTTAA 


GAGAGGCCCT 


93720 


GTCCACATAG 


GGCTGCAGCT 


93780 


TTCATGTTGG 


AGAACACTGC 


93840 


TTGGCTGCAA 


AGAGCCACAC 


93900 


CCACCAGGCC 


ATGGTGTGAA 


93960 


TTCTCCTCCA 


GGGCTCCCTG 


94020 


GACCTCTCCT 


CTCCCGTCCT 


94080 


CCTCGACCTG 


TCTCAGCATC 


94140 


TGTGTCACTC 


CCCAAGCATG 


94200 


TCTGACTTCA 


GTGGTCCTAC 


94260 


TCTTGGCTAG 


AAGTCACAGC 


94320 


GGATCAAAAA 


AGAGATCAGG 


94380 


AGGCACTGGG 


GACACCCCAC 


94440 
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ACCCACTGTG 


CCAGCCTAGG 


GCAGGGAGCC 


CGAGGAAGCC 


ACAGGCTCTG 


CCCTGCTCAG 


94500 




TGCTGGACTC 


AGTGCCTGGC 


CCAGGCTGAG 


AAGGAGATAA 


ACTGCAGCCT 


TGGGGGTGTG 


94560 




GGGAAGGGGC 


ACCACACTGG 


GATCTCAGAA 


ATGCCCAAAA 


CCTGTGTCAA 


AATAGGAGAC 


94620 




TGCCGCTGTG 


AGACCCTGAG 


GAGTCTTCTG 


GTGATCATGG 


AAGAACAAAT 


GTTAAGCTAG 


94680 


5 


AACTGAAGGA 


ACCTCATCAG 


GGGAGAGGCA 


GCCATCCTGC 


CGTCCCCACA 


TCTGGTCTTT 


94740 




GCCATTTCTG 


TGTCCTGTGG 


TGGTCAGCAG 


CAAGGTCTCT 


GAGCCGAAAG 


GAGGCACTCA 


94800 




CTTTGGAGGA 


GTGCAGGGTC 


CCCAGGTCCC 


CACACTTTGT 


CTTGTCCTGA 


CTGAGAAAGA 


94860 




AACAGACTGC 


CCTGACCTCT 


CTGACTTGGC 


CAGCGAGGTT 


GCCCTTAGGC 


TCAAACCCAA 


94920 




GCCAGGGTTT 


GAACATTCCC 


AGACACTTGT 


AAGATGTTTA 


GGTTGTTAAC 


ATAATGTTCA 


94980 


10 


GGTTTCAAAA 


CATTGAAAGA 


AACTAGCCCC 


AGCCCTGAAC 


CCAGATCCCC 


CCCGGCTTCA 


95040 




GGCATGACCA 


GTGAACACGC 


CCTTCTCTCA 


CTGGTCACCT 


GAGGATGCCG 


CACTCTGTCA 


95100 




ACAGGTTCCC 


CTAATACATG 


CTCTGATCTG 


ATCGCCTTGG 


CATTTAGTGA 


TTCTTTCCCT 


95160 




GGAATTCTCC 


ACTGGCCCCA 


TCGCAGGGAA 


CTCCCAAGTG 


GGAAACTCCC 


CTACCACCAC 


95220 




TTTTGGGGCA 


ACTTCAGCTA 


AGGGTTCAGC 


TGGGACAAAA 


CAGGGAGCCA 


CTCGGGAACC 


95280 


15 


TGGGACAGGA 


CCAGAGAGAA 


AACCCGAGGG 


ACAGAGTGGG 


TAAGGAAAGC 


TGCTGAGGAA 


95340 




GGGCCCAAAG 


GGCACTCTGG 


AAAGAAGTGG 


CACTGGAGGG 


CTGGGGTGGG 


GGTGGTCCTG 


95400 




GCCAGGGAGT 


CTTACCTTGT 


CCCACAAAGA 


CTGCAGCTCT 


TCCTGCCCTC 


CTAAGTCCCA 


95460 




GAACATGAGC 


CGAGCCTTTC 


CCACATCCAC 


AGTGCCGACT 


GGGGAGAGGA 


GGAAACAGGC 


95520 




AAGGCTCATG 


ACCTTGGTCC 


TCGACACACC 


CAGTCCCAGC 


TCTCCCAGGG 


GATGGGGCAA 


95580 


20 


ACCATGCTGG 


TGCCACTCAA 


ATGAGACTTG 


AGAGGGGCCC 


GACAGGGCTG 


TGGCCACGGG 


95640 




CCAGCTGGAC 


TGTGAATATC 


ACGGCATCCT 


CAAGGCCCCA 


AACCCACAGC 


CTGCTATTGA 


95700 




GACCCTTACT 


GTTTAGGCCC 


ACGGTGGTGG 


TGATTTTGGA 


TAGACTCATC 


CCCTTGTAGT 


95760 




TCTTGTTAAA 


TCGGGTTTTC 


GACTGCTCCA 


GGAAGGTCTG 


AGGAGAGAGG 


CAGAGGCGAA 


95820 




ACACATCAAG 


GAGGGGCTAT 


ACTGGCTTCC 


AAATATCCTT 


ACTCAGGTCT 


GTTCTTTAAA 


95880 


25 


AGACAGAAAC 


AGAAACAGAG 


CAACACTCTG 


CTCTTCAGGA 


GGCTGGTGGT 


GACTATCCTG 


95940 




CCGTCTCAGG 


TGAAATTTGG 


CTTCCGTCTG 


GGTAGTGAAC 


GTGCAGCTGA 


CAGCACAAAA 


96000 




CCGAAGGGGG 


CGCCGCCAGG 


CCGTGGGAAA 


GGTGCGCGCA 


AGGGCGTGGG 


CACTCACCGT 


96060 




CTTCCCAGCA 


TTGTCCAGGC 


CCAGGATCAG 


GATGCAGTAC 


TCGTCCTTCT 


GAAACATGTA 


96120 




CTTGTACAAG 


CCCGACAGCA 


GCGTGTACAT 


CCTGCCCTGG 


GCACCCCAAC 


ATAGGTCAGT 


96180 


30 


GTGCAGCCAG 


AAAGCACCTC 


CCCTCCCCCG 


GGCTTCTCCA 


CGGTGGTCAG 


TGGCGCCCCA 


96240 




CGTCCAGCCG 


ACCGCTCAGG 


ACGAGAGCCT 


GGGGGCCATT 


CCCGACTCCT 


CGTCCCTCTC 


96300 




CCACCCCGTC 


CCTCTGTAAC 


TTCTCCCAGG 


TCAGCCGCCA 


CTGTGTCCTG 


CTCACAGCAA 


96360 




TGACTGCGAC 


CTCTCCGCAT 


ACACATCGGT 


TCCGGCCCCT 


CCCCTGCTCG 


CGGGACTACC 


96420 




CAGCCGGGTG 


TTCACAGTGA 


GCTCAGCCGC 


GCTCCCGCCC 


TCCCCCGAGG 


CTTCGCTCCC 


96480 
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ACGCTTCACG 


CGCGCGGAAC 


GGGGAACACA 


CTCGCTGCAG 


CCCCGCCTGG 


GCCACGGCAC 


96540 




CCTCGAGCGC 


CAGCCCCGCG 


CCCCACCCGG 


GAGCAGCGAG 


CCACCGGCGC 


GCTCCCCAGG 


96600 




AGCCCCTGCA 


GGCGCCGGGT 


AGGGACGCCC 


CATCACCCCA 


TTTCTTAAAA 


CGGGGACGGC 


96660 




CCTGGGGGGA 


GCGGACTACA 


GGGCGGGTGA 


GCAGCGGCGC 


GGCTGCTCCT 


GGAGTGCACC 


96720 


5 


TGGAGGCGGC 


GCGCGGCTGG 


CAGGGAACGA 


CTGCGAAGGA 


AGAACCTGGG 


TCGCGGCCCC 


96780 




CGGCTACGTC 


CGCCCCAAGC 


CGCCGCCGCC 


AGGTCTGAGG 


CTCCCCGACA 


AGCAGCCAAA 


96840 




GCTGGCTCCT 


GTCACACCCG 


CGTCCCACCT 


CGAGTCCTGG 


GCCGCCCCTC 


GGGCCTCGCG 


96900 




CCTCACCGCA 


CAGCCTGCGG 


CCTACCTGCG 


TCCGCCGCGC 


CCTCGGAGCC ' GCTGCTGCTG 


96960 




ACCCCCGCTG 


ACCTCCGCTG 


ACCCCGCGCT 


AACCCCGCGC 


GGCGCCTGAC 


GGGACGCGGG 


97020 


10 


CCGGCCTCAG 


GGAATGAGCT 


GAACCGCGTC 


CCAGCGGCCT 


CCGCGCTCCG 


CTTCCCGGCT 


97080 




GCCCCCGCGC 


GCCAAGCACT 


TCCGGAAGCG 


GCGGCGCTCG 


GGAGGAAGTG 


CCGATCGGCT 


97140 




GCTGGGGCGA 


AAAGGGGGCG 


CCGGGCCGCT 


CTAGCCGGTG 


AGGCCGGCGG 


GCTCTCTGTG 


97200 




GCTGCGGCTG 


GGAAACCGCG 


CGGAGGAGGT 


GCCCGGCCGG 


GGACCAGGTG 


GCCGCGGTTT 


97260 




GCGGGGACGC 


GGCCCTGGCC 


AGACAGAAGA 


GACGCCGGGC 


GGGGGGGCGC 


GGCCGGCCTG 


97320 


15 


GAAGGCGGCG 


GGCGCGGCGG 


GTGGGCTCGG 


CGGAGGGTGA 


GGCGGCGGGG 


CGCCCCGCGG 


97380 




GGAAGGGGCT 


CCGGAGTGAC 


GCGGGACCCG 


GCTAGCGGCG 


AGCCCACGGC 


GGCTCGGAAG 


97440 




GGAAGCGCGG 


AGCCTGAGCG 


GGGGTACCCG 


GGCTGCGACC 


TCTGCGCTGG 


GAGCTGTGCC 


97500 




TCTGAGCCGG 


TGTCTCCCCG 


AGGGAAAGGG 


GACGTGCCCG 


TGCCCGTGCC 


CGCCCTCAGG 


97560 




CTGTGGGGTC 


GGTCCCGAGA 


CGCGGGGCTC 


AGCTGGCTTC 


TCTTCTTGCA 


GCCCTGGTCC 


97620 


20 


AGCGCCTCCC 


TCTCTCAGCA 


TGGACGAGGA 


GAGCCTGGAG 


TCGGCCTTGC 


AGACCTACCG 


97680 




TGCGCAGCTG 


CAGCAGGTGG 


AGCTGGCCTT 


GGGCGCCGGC 


CTGGATTCGT 


CTGAGCAGGC 


97740 




TGACCTGCGC 


CAGCTGCAGG 


GGGACCTGAA 


GGAGCTCATC 


GAGCTCACCG 


AGGCCAGCCT 


97800 




GGTGTCTGTC 


AGGAAGAGCA 


GGTTGTTGGC 


CGCGCTGGAC 


GAAGAGCGCC 


CGGGCCGCCA 


97860 




GGAAGATGCT 


GAGTACCAGG 


CTTTCCGGGA 


GGCCATCACT 


GAGGCGGTGG 


AGGCACCAGC 


97920 


25 


AGCGGCCCGT 


GGGTCCGGAT 


CAGAGACCGT 


TCCTAAAGCA 


GAGGCGGGGC 


CAGAATCTGC 


97980 




GGCAGGTGGG 


CAGGAGGAGG 


AAGAGGGAGA 


GGACGAGGAA 


GAGCTGAGTG 


GGACAAAGGT 


98040 




GAGCGCGCCC 


TACTACAGCT 


CCTGGGGCAC 


TCTGGAGTAT 


CACAACGCCA 


TGGTGGTGGG 


98100 




AACGGAAGAG 


GCGGAGGATG 


GCTCGGCGGG 


TGTCCGTGTG 


CTTTACCTGT 


ACCCCACTCA 


98160 




CAAGTCTCTG 


AAGCCGTGCC 


CGTTCTTCCT 


GGAGGGAAAG 


TGCCGCTTTA 


AGGAGAACTG 


98220 


30 


CAGGTAAAGC 


CCTTTGTTGT 


CAGATGCCAA 


CCTTAGGGGC 


GTAAGGGGCA 


CGCACACAGG 


98280 




GTCGGGTCAG 


GATCGGCCCT 


CCCTTTGCTT 


TGCAGTTTTG 


TCTCAGCTTC 


CTGGGGCAGG 


98340 




CGTGCTTTGA 


CAGCTGTGTC 


TGTGTTCAGG 


CGTCTACGTC 


TTCCTTCTGG 


GGTGAATCAA 


98400 




GAAGCATGGA 


AGGAGGCCAG 


GCGCGGTGGC 


TCACGCCTGT 


AATCCCAGCA 


CTTTAGGAAG 


98460 




CCGAGGCGGG 


CAGATCACCT 


GAGGTCAGGA 


GTTCAAGACC 


ACGCTGGTCA 


ACATGGTGAA 


98520 
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ACCCCATCTC 


CTTAAAAACA 


CAAAAATGAA 


CCGGTCGTGG 


TGGCGCGCAC 


CTGTGGTCCT 


98580 




GGCTACTCAG 


GAGGCTGAGG 


CAGGAGAATT 


GGTTGAACCC 


AGGAGGCCGA 


GTTTGCAGTG 


98640 




AGTGGAGATG 


CAGCCACTGT 


ACTGCAGCCC 


GAGCAGCAGT 


GCAAGGCTTA 


TGTGGAAGAG 


98700 




AGTAGGTCTC 


CAGCCTATCG 


TCAGTTTTTT 


TTTGGTGGTT 


GTTTTAATTT 


TTTTTGAGAC 


98760 


5 


AGGGTCTTAC 


TTTGTCAACC 


AGGCTGGAGT 


GCAGTGGCAT 


AGTCCTGGCT 


CACTGCAGCC 


98820 




TGGACCTCCT 


GGGCTCAACC 


GATCCTCCTG 


CCTCAGCCCC 


CCTAGGAGCT 


GGGCTACAGA 


98880 




CTCACGCTAC 


TACACCCAGC 


TAATTTTTAT 


ATTACTATAA 


TTTTTTATCT 


TTTTTTTGAG 


98940 




ACGGAGTCTT 


GTTCTGTTGC 


CCAGGCTGGA 


GTGCAGTGGC 


GTGATCTCGG 


CTCACTGCAA 


99000 




GCTCCGCCTC 


CCGGGTTCAC 


GCCATTCTCC 


TGCCTCAGCC 


TCCCGAGTAG 


CTGGGACTAC 


99060 


10 


AGGCGCCCGC 


CACCATGTCT 


GGCTAATTTT 


CTGTATTTTT 


AGTAGAGACG 


GGGTTTCACC 


99120 




ATGTTAGCCA 


GGATGGTCTC 


AATCTCCTGA 


CCTCGTGATC 


CGCCCACCTT 


GGCCTCCCAA 


99180 




AGTGCTGGGA 


TGACAAGCGT 


GAGCCACCGC 


GCCTGGCCTT 


TTTTTTTTGG 


AGACAGAGTT 


99240 




TCACTCTCCT 


CACCCAGGCT 


GGAGTGTAGT 


GGCGCAATCT 


CAGCTTACCG 


CAACCTCTGT 


99300 




CTCCCGGGTT 


GAAGTAATTC 


TCTACCTCAG 


CGTCCAGAGT 


AGCTGGCATT 


ACAGGCGCCC 


99360 


15 


GCCACCACAC 


TCGGCTAATT 


TTTTGTATTT 


TTAGTAGAGT 


CGGAGATTCA 


CCATCTTGGC 


99420 




CAGGCTGGTC 


TTGAACTCCT 


GACCTCGTGA 


TCCACCCACC 


TTGGCCTCCC 


AAAGTGCTGG 


99480 




GATCACAGGC 


GTGAGCCACT 


GCGCCTGGCC 


CTGTTGTTAG 


TTTTATTCTC 


TAGAGTTCAA 


99540 




CTTTTAAATT 


TTACTTTCAT 


GGAGATTTTC 


AAACATACCC 


CAAATTAGAG 


AGTTTAGCAT 


99600 




AATCACCGCC 


CACGGTCCAT 


CATCCAATGT 


CGTCATTTAT 


TAATATTTTC 


CCAGTCTCAT 


99660 


20 


TTTGTCTGTT 


CTCCCTGCCC 


TATTTTTTTC 


TTTCCTGGGC 


CATTTTAAAG 


CAAATTCCAG 


99720 




AAGTTACTGG 


TTTTTTCCAA 


TTATGAATAC 


TTCATAGTTG 


CATCTCTAAT 


CTAACTGATT 


99780 




AGGAAATTAC 


TTAAAAAGTA 


ACTTTTTGGA 


AGTCCAAGTC 


CGATGTGAGG 


ACAAAAAAGA 


99840 




GTAACTTCTG 


TGTCATAATA 


GGTAACACAT 


TTAATGGTAA 


TACCTCTTCC 


ATATTCAAAT 


99900 




ATGAACAATT 


ATTACTGTAA 


TGTCTCTATT 


TCCCTAAGCG 


CATAGCTTTA 


TTTTTCCTCC 


99960 


25 


TTTTTACTTT 


TCTCTTAGAA 


GAAATATTTA 


CCAAGCCTTC 


TAGTAGGTAA 


TTTTCTTTTT 


100020 




TAGCCAATAG 


TTCAGGCTGA 


CCGTGTAACC 


ATCCCTAGTT 


CTAGTTCTAG 


TTCTTTGAAT 


100080 




GTCTTCCTTT 




TTGAAACAGC 


GTCTTGCTGC 


TCTGTCACCC 


AGGCTGGAGT 


100140 




GCAGTGGCAC 


AATCTCGGCT 


CACTGCAATC 


TCCGCCTCCC 


TGGCCCAAGC 


CATCCTCCCA 


100200 




CCTCAGCCTC 


CCTAATAGCT 


GATACTACAA 


GTGTGCACTG 


CCACGCCCAG 


CTAATTTTTG 


100260 


30 


TATTTTTTGT 


AGAGACGGGA 


TTTCACCATA 


TTACCCAGGT 


CTCGAATTCC 


TGATCCCTTT 


100320 




GATGAGAGAT 


CTGACACATC 


CCTGTGGTGC 


TCCCTCTGGA 


CCAGGCACTG 


CTCCAAGGGT 


100380 




TTCATATACT 


TTCATTCATC 


TGTGCAACAG 


CCCTGTAGGT 


AGGCCCTGCA 


GTCACACCAT 


100440 




CTGACAGAGG 


AGGAAACAGG 


AGTAGAAGAA 


CTGAGTGGTC 


CAGGGCTTCA 


AGGCTCAGAG 


100500 




GGCTCCAGTT 


GCCCCCAGCC 


CTCGTTCCGT 


CCCCTGCTCC 


ACCCAGTGCT 


GCTTGCCATG 


100560 
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TCGGCATCAG GCCTGATCTG 
ATTCAGGACT GATAAGTTCT 
AGTTTCACTG TTGTTGCCCA 
AGGGATTCTC CTGCCTCAGC 
5 GTGCCCAGCT AATTTTTATA 
GTCTCGAACC CTTGACCTCA 
GCATGAGCCA CCACACCCGG 
AAGTATATAG TTTGCAGATA 
TGTCCTTTGA TGCACAAACA 

10 GCCTATGTTT TTGTCATCAG 
CCTATGTTTT CTTCTAAGAG 
TTTGAGTTAA TTTTTGTATA 
ATTTCCAGTT TTCCCAGCAC 
TGGCACTTTT GTCCAAAATC 

15 TCCTGTTCCA TTGGTGTATA 
ATAAGTTTTA AACTCAGGAA 
TTTTGTTTCT GGATCCTTTG 
CATAAAAGAC AGTTTGAATT 
ACATTGTCAT CTTTACAATA 

20 CCCAGTACTT TGGGAGGCTG 
CTGGCCAACG TGGTGAAACC 
GTGCACACCT GTAATCCCAG 
GAGGAGGAGG TTGCAGTGAG 
GAGACTCCAT CTTAAACAAC 

25 TCATGAATGA AGGATCTGTC 
ACTGTTCAAG TCTTGCACAT 
AATTGTTTTT CTTTTCCTTT 
AGTGCAGTGG CACAATCTTG 
CTGCTCAGCC TCCCAAGTAG 

30 TTTTTTTGAG ATGGAGTCTT 
CTCACTGCAA GCTCCACCTC 
CTGGGAATAC AGGTGCCTGC 
GGGTTTCACC ATGTAGCCAG 
CGGCCTCCCA AAGTGCTGGG 



AAAGCTTCCG GAGCATCTTA 
CTTGGATTTG CGTTGGACCT 
GGCTAGAGTA CAATGGCACG 
CTCCCAAGTA GCTGGGATTA 
TTTTTAGTAG AGGCAGGGTT 
GGTGATCCCG CCTTGGTTTC 
CCCAGGATTT CTTTATATAT 
TTTGCTCCCA TTGTTTGGGT 
TTTTAAATTT TGATGCAGTG 
GTTTAAGAAA CCACCTCATC 
TTCTGTAGTT TTAGCTGTTA 
TGTTATTAGG TGAGGGTCCA 
CATTTGTTTA AAAGACTGCT 
AATTGGCAAT ATATGTAAGG 
TGTGCCAGTA CCACACTGTT 
GTGGTAGTTA TTCACCATTT 
CAATTTCATA TGAATTTTAG 
TTGATATGGA TTGCATAGAA 
TTAAGCCTTC TGGCTGGGTG 
AGGCGGGCAT ATCACTTGAG 
CCGTCTCTAC TAAAAATAAA 
CTACAGGAGA GGGTGAGGCA 
CTGAGATCAT GCCACTGCAC 
AACAATAACA GAAGAAAAAA 
CATTTATTTA CGTCTTTAAT 
TCTTGGTTAA ATAAGTATTA 
TTTTTTTTGA GACAGAGTCT 
GCTCACTGCA ACCTCTGCCT 
CTGGGATCAC AGGTGCCTGC 
GCTCTGTTGC CCAGGCTGGA 
CCGGGTTCAC ACCATTCTTC 
CACCACGCCC AGCTAATTTT 
GATGGTCTCG AACTCTTGAC 
ATTACAGATG TGAGCCACTG 



CAGACGTCCA 


CCTTGCCACC 


100620 


TTTTTTTTTT 


TTTAAGATGG 


100680 


ACCTCCACCT 


CCTGGGTTCA 


100740 


CAGGCGCCTG 


TCACCACGTG 


100800 


TCACCGTGTT 


GGCCAGGCTG 


100860 


CCAAAGTGCT 


GGGATTACAG 


100920 


TCTGGATATC 


ATCCCTTATG 


100980 


TGTCTTTTCA 


CTTGATATAG 


101040 


CAATTTATTG 


TTTCTTTATT 


101100 


CATAGTTATG 


AGGATTTTCA 


101160 


AATTTAGGTC 


TTTGATCCAT 


101220 


CTTTATTCTT 


TTGCATGTGG 


101280 


TTTTCTCCAC 


TGAATGGTCT 


101340 


GTTTATTTCT 


GAGCTCTCTC 


101400 


CTGATTATTA 


TAGCTTTGTG 


101460 


GCTCCTCTTT 


TTCAAGTTTG 


101520 


GATCGGCTTG 


TCCAATTCTG 


101580 


TGTGTAGATC 


TGTTTGGGGC 


101640 


TGGTGGCTGA 


CGCCTGTAAT 


101700 


GTCAGGAGTT 


CAAGACCAGC 


101760 


AAACAAATTA 


GTCGGAGGTG 


101820 


GGAGAATCGC 


TTGAACCTGG 


101880 


TCCAGCCTGG 


GTAACAGAGG 


101940 


ACAGTATTAA 


GTCTTCCAAT 


102000 


TTCTTTCAAC 


AGTATTTTGT 


102060 


TTTTTGATGC 


TTCTCTAAGG 


102120 


TGCTCTGTCA 


CCCAGGCTGG 


102180 


CCCGGGTTCA 


AGCAATTCTT 


102240 


CACCACACCC 


AGCTAATTTT 


102300 


GTGAAGTGGC 


CCAATCTTGG 


102360 


CGCCTCAGCC 


TCCTGAGTCG 


102420 


TTGTATTTTT 


AGTAGAGATG 


102480 


CTCAGGTGAT 


CTGCCTGCCT 


102540 


TGCCCGGCTC 


GAGTTGTTTT 


102600 
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CCTTAGTTAC 


ATTTTCAGGC 


TGTTTGTTGC 


TAGTATATAG 


AAATACAAGC 


TGGGCACCGT 


102660 




GGCTCACGCC 


TGTAATCCCA 


GCACTTTGGG 


AGGCCAAGGC 


GGGTGGATCA 


CCTGTGGTCA 


102720 




GGAGTTCGAG 


ACCAGCCTGG 


CCAACATGGT 


GAAATCCAGC 


CTCTATTAAA 


AATACAAAAA 


102780 




TTAGTCTGGC 


ATGGTGGCAG 


GTGCCTGTAA 


TCCCATCTAC 


TCAGGAGGCT 


GAGGCAAGAG 


102840 


5 


AATTGCTTGA 


ACCTGGGAGG 


CGGAGGTTGC 


AGTGAGCTGA 


GATCGCGCCA 


TTGCACTCCA 


102900 




GCTTGGGGAA 


CAAGAGTGAG 


ACTTCATCTC 


AAAAAAAAAA 


AAAAAGAAAT 


ACAGTGGATT 


102960 




TTTTTATGTT 


AATCCTGTAT 


TGATTGCTGA 


ATTGGTTTAT 


TAGTGCTAAT 


AGGATTTTTT 


103020 




ATGCACTATT 


TAGGATTTTC 


GATATATACA 


ATCATATATA 


TTCAATATAT 


ACAATTAATA 


103080 




TATATGTGAA 


TAGAGATAAT 


TGTAGTCTTT 


GTTTCTAGTT 


TGCATGGCAT 


TTATTTCTTT 


103140 


10 


TTCTTGCTTA 


ACTGCCTTAG 


CTAGAACTTC 


AAGTACGATG 


TTGAATAAAA 


GTGACTAGAG 


103200 




CGGGCCGGGG 


GTGGTGGCTC 


ACACCTGTGT 


TCCCAGCACT 


TTGGGAGGTG 


GAAGTGGGCA 


103260 




GATCACTTGA 


GATCAGCAGT 


TTGAGACCAG 


CCTGGCCAAC 


ACGGCGAAAC 


CCCATCTCTA 


103320 




CTAAAAATAC 


AAAAATTAGC 


TGGGTGAGGT 


GATGTGCACC 


TGTAGTCCCA 


GCTACTTGAG 


103380 




AGGGTGAGAC 


ATGAGAATTG 


CTTGAACCTG 


GGGGGCGGAG 


GTTGCAGTGA 


GCCAAGATCA 


103440 


15 


TGCCACTCCA 


CTCCAGCCTG 


GACGACAGAG 


CAAGAACCCT 


GTCTTTAAAA 


AAAAAAAAAA 


103500 




AAAAGTGGCT 


AGAACAAACA 


TCTTTATCTT 


GTTCCTGATC 


TTAGGTGGAA 


AACTTTTTTG 


103560 




TTCCTGATAT 


TAGGTGGAAA 


ACTTTTAGTC 


TTTCACTGTT 


GAATATGATG 


TTACTTGTAG 


103620 




GTTTTCTGTA 


GATTCCCTTT 


ATCGAGTTGA 


GGAAATTCTC 


TTATATTCAT 


AGTGTGTTGA 


103680 




GTGTTTTTTA 


TCATGAAAGG 


GTGTTGATTT 


TTTTTTTAAA 


GATAGGGTCT 


TGTTCTGTCA 


103740 


20 


CCCAGGCTGG 


AGGGCAGTGG 


CATGATCATG 


GCTCACTGCA 


ACCTCGAATT 


CCTGGGCTCA 


103800 




GGGGATCCTC 


CTACTTCATC 


CTCCTGAGTA 


GGTGAGACTA 


CAGGCATGAG 


CCACCATGCC 


103860 




CAGCTAATTT 


TTTAATTTTT 


CTGTAGAGGT 


AGGGTCCTGC 


TTTGCTGCCC 


AGGCTGGTCT 


103920 




TAAACTCCAG 


GGCTCAAGCA 


ATCCTGCCTC 


AGCCTCCCAA 


AGTGCTGAGA 


TTACAGGGGT 


103980 




GAGTCACTGC 


ACTGCACCCA 


GCTGTGTGGG 


ATTTTTCAAA 


TGCTTTTTTC 


CTTTAGATGA 


104040 


25 


TCATGTGTGG 


TTTTTTTCCT 


TTCATTTTGT 


TAATGTGGTA 


TATTGATTTT 


CGTATGTTGA 


104100 




ACCATCCTTG 


AATTCCTCAG 


ATAAAGCACG 


CATATTCATG 


GCGTATTATC 


TCTTTATTAT 


104160 




TATTTTTTTT 


GTAGAGATGA 


GATTTCACTC 


TGTTGCCCAA 


GCTGGTCTCA 


AACTCCTGGG 


104220 




CTAAAGTGAT 


CCTCCTGCCT 


CAGCCTCCGA 


AAGCGCTGGG 


ATTATAGGCA 


TGAGCCACTT 


104280 




GGCCCTATCT 


TTTTTCTTTT 


TCTTTTTTTT 


TTTTTTTTGA 


GACAGAGTCT 


CACTCTGTCG 


104340 


30 


CCGGGCTGGA 


GTGAGTGGCG 


CGATCTCGGC 


TCACTGCAAC 


CTCCATCTCC 


CGGGTTCAAG 


104400 




CAATTCTCCT 


GCCTCAGCCT 


CCTGAGTAGC 


TGGGACTACA 


GGTGCCCGCC 


ACTATGCCCA 


104460 




GCTAATTTTT 


TGTGTTTTTA 


GTTGAGACGG 


TGTTTTGCCA 


TGTTGGACAG 


GCTGGTCTTG 


104520 




CACTCCTGAC 


CTCGTGATTC 


ACCCACCTTG 


GCCTCCCGAA 


GTGCTGGGAT 


TACAGGCATG 


104580 




AGCCACCGCA 


GCGAGCCTTA 


TCTTTTTAAC 


AGTTAAAAGT 


TTAAGGCCTT 


ATCATGTAAT 


104640 
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AACATTGCTG 


GATTTGATTT 


GCTGCTGTTT 


TGTTGAGAAT 


ATTTGCATCT 


GTATTGATAA 


104700 


GGGATATTGG 


TCTGTAGTTT 


TCTTTTCTTG 


GCATGTCTTT 


GTATAGCTTT 


GATGCCAGCA 


104760 


TAATATTGGC 


CTCATAGAAT 


GAGTTAGGAA 


GTATTCTTTA 


TATTATGGGA 


AGAGGTAAAA 


104820 


AGGGATTGGT 


GTTAATTCTT 


CTTCAAATGT 


TTGATAGAAT 


TCAACAGTGA 


AGTGATATAT 


104880 


ACAATCATAT 


ATATAGAGAG 


AGAGAGAGAG 


AGAGATGGAC 


TTTTCTTTTG 


TTGGAAGTTT 


104940 


ATTGACTATT 


GATTCAATTT 


CCTTATTGAA 


ATTGACTTTT 


CTTTTTGGAA 


GCTAAAATGT 


105000 


ATAACTGTAG 


TGAAAGTTTC 


TGAACTTTTC 


TTTCATTGGA 


AGTTTTTTGA 


CTACTGATTC 


105060 


TTTATTTGTT 


ATAGGTCTAT 


TCAGATTTTC 


TGTTTCTTCT 


TGAGTCAGTT 


TGGTCTCGCT 


105120 


CTGTCGCCCA 


GGCTGGAGTG 


CAGTGGTGCC 


ATCTTGGCTC 


ACTGCAACTT 


CTACCTCCCG 


105180 


AGTTCAAGTG 


ATTCTCCCAC 


CTCAGCCTCC 


CCAGTATCTC 


GGACTACAGG 


CGCACGCCAG 


105240 


CATACCTGGC 


TAATTTTTGT 


ATTTTTAGTA GGAACAGCAT 


TTCACCATGT 


TGGCCAGGCT 


105300 


GGTCTCGAAC 


TCCTGACCTC 


AGGTGATCCA CCCGCCTCGG 


CCTCACAAAG 


TGCTGGGACT 


105360 


ACAGACATAA 


GCCACCGCGT 


CCAGCCTTGA 


GTCAGTTTAG 


ATAGTTTGCA 


TGCATGTTTC 


105420 


TAGGAATTTG 


TCCATTTTGT 


TTATGTTATC 


TAATCTGTTA 


CCATACAATT 


GTTCATAGTA 


105480 


TCCTTTTATA 


GCCCTAGTTA 


TTTCTGTAAG 


ATCAGTAGTA 


ATAGCTCCAC 


TTTCTCTCTT 


105540 


GGTTTTAGCA 


ATTTGAGTCA 


TCTCTTTTCT 


TCTTCTTTTT 


TTTTTTTTGA 


GATGGAGTCT 


105600 


CACTGTGTCA 


CCCAGGCTGG 


AGTGCAGTGG 


CATGATCTTG 


GCTCACTGCA 


ACCCCTGCCT 


105660 


CCCAGGTTCA 


AGCAATTCTG 


CCTTAGCCTC 


CTGAGTAGCT 


GGGATTACAG 


GTGTGAGCCA 


105720 


CCACACCCAG 


CTAGTTTTGT 


TTTGTTTTTT 


TGTTTTTGAG 


ACGGAGTCTG 


TTTCTGTCTC 


105780 


CCAGGCTGGA 


GTGCAGTGGT 


GCAATCTCAC 


TCATTGCAAC 


CTCCGACTCC 


CAGATTCCAG 


105840 


CAATTCTCCT 


GCCTCAGCCT 


CCCGAGTAGC 


TGGAACTATA 


GGCGTGCACC 


ACCACGCCTG 


105900 


GCTGATTTTT 


ATATTTTTAG 


TAGAGATGGG 


ATTTCACCAT 


GTTGGCCAGG 


CTGGTCTTGG 


105960 


ACTCCCTACC 


TGAGGTGATC 


CGCCCACCTT 


GGCCTCCCAA 


AGTGCTGGGA 


TTATAGGCAT 


106020 


GAGCCACCAT 


GCCCAGCCAG 


TTTTTGTATT 


TTTAGTAGAG 


ATGGGGTTTC 


TCCCTGTCGG 


106080 


CCAGGCTGGT 


CTTGAAATCC 


TGACCTCAGG 


TTATCCACCA 


GCCTTGGCCT 


CCCAAAGTGC 


106140 


TAGGATTACA 


GGCATGAGCC 


ACCACGCATG 


GCCTGTCTTT 


TCTTCTTGGT 


CATTTTCGCT 


106200 


AAAGGTTTGT 


CAATTTTGTT 


GATCTTTTTT 


GTTGCTGATC 


TCTATTGTTT 


TCCCATTCTG 


106260 


TTTCATTTAT 


TTCCATTTTA 


ACCTTTGTTT 


CCTTTTTTCT 


GCTGGTTTGG 


GTTTAATTTG 


106320 


CTCTTTTTTT 


CCCCTAATTT 


TTCAAGGTAT 


ACAGTTAAGT 


TATTGATTTG 


AGATCTCTTT 


106380 


TTTCTTTTCT 


TTTTTTTTTT 


TTTTTTTTTT 


TTTGGTTGCT 


GTTGAGATGG 


AGTCTCCCTC 


106440 


TGTCACCCAG 


ACTGGAGTGC 


AGTGGCATGA 


TCTCAGCTCA 


CTGCAGCCTC 


CGCCGCCCAG 


106500 


GCGATTCTCC 


TGCCTCAGCC 


TCCTGAGTAG 


ACGTTTCCCG 


GCCAAGGTGT 


TTCTTTTTGA 


106560 


ATGTAAGCAT 


TTACAGCTAC 


AGATTTCCCT 


CTAAACACTG 


CTTTCACTGC 


ATTCCATAAG 


106620 


ATTGTTTTTT 


GTTGTTTTTT 


GTTGTTGTTT 


TGTTGTTTGA GACACAGTCT 


CACTCTGTTG 


106680 
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CCGTTTGGAG AGCAGCGATG CGATCATAGC 
GTCCTCCTGC CTCAGCCTCC CAAGTAGCTG 
TAATTTCTTT TATAAGTTTT TGCAGAGGCC 
CACTTTGGGA GGCCAAGGTG GGTGGATCAC 
5 CGACAGGGAG AAACCCCATC TCTACTAAAA 
TGCCTGTAAT CCCAGCTACT CAGGAGGCTG 
AGAGGTTGCA GTGAGCCAGG ATCACACCAT 
CTCCATCTCA AGAAAAGAAA AAAAAAAGTT 
CAGGCTGGTC TCAAACTCCT GACTTGAAGG 

10 GAGATTATGG GCAAGAGCCA CCGCACCCTG 
CATTTTCATT GATCTCAAGA CATCCTAATC 
TTATTCAAGA GTGTCTTTAT TTCTGCATAT 
TTTTTTTTTT GAGAAAGGGT CTTGCTCTGT 
CTTGCCTCAC TGCAACCTCT GCCTCCCGGG 

15 AGTAGCTGGG ATTACAGGCA CACACCACCA 
ACGTGCTGGT CAGACTGGTC TCGAATTCCT 
CCAAAGCACT GGGATTACAG GCGTGAAACA 
ATAGAGAGAA GGTCTCACTC AAGCCCAGGC 
TCCCTCCTCG GCCTCCCAAG GTGCTGAGAT 

20 TTTCAAGACT TTAAAAATGC CATCTTGGCT 
CACTTTGGGA GGCCGAGGTG GGCAGATCAC 
ACATGGTGAA ACCCTGTCTC TACTAAAAAT 
GCCTGTAGTC CCAGCTACTC GGGAAGCTGA 
GAGCTTGCAG TGAGCTGAGA TCACACCACT 

25 CCGTCTCAAA AAAAAAAAAA AAAATGTCAT 
TGAGAAATTG GCTGTTAATC TTATTGAGGA 
TTGCTGTTTT AGGAGATTCT CTATCTTTGG 
GTGGATCCCT CAATTTATAA GCTACTTGGA 
ATGTCTTTCA TTAAATTTGC AAAGTTTCAG 

30 TTTGTTTCTT TCTGTCTGTT TGCCGCTTAT 
TGGTGTAGCA CAGGTCCCTT GGGCTCTAGG 
TCCTCATTTT GGATAAATTC AGCTGACCTG 
TTCTCAAATC TGCTGTTGAA ACTTCTGGTG 
GCTCCAAAGT TTCTATTTGG TTTCTTTCTG 
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TCTGTAGCCT 


TGAGCTCCTG 


GACTCAATCA 


106740 


GGACTACAGG 


TGTACACCAC 


TGCACCTAAC 


106800 


AGGCACAGTG 


GCTCACACCT 


GTAATCCCAG 


106860 


CTAAGGTCAG 


GAGTTCGAGA 


CCAGCCTGGC 


106920 


ATACAAAAAT 


TAGCTGGGCG 


TGGTGGCAGG 


106980 


AGGCAGGAGA 


ATCGCTTGAA CCTGGGAGGC 


107040 


TGCACTCCAG 


CCTGGGTAAC 


AAAAGCAAAA 


107100 


TTTGCAGAGA 


CAGGGTATCA 


CTTTGTTGCC 


107160 


AGTCCTACTG 


CCTCAGCCTC 


CCAAAGTGCT 


107220 


CCACTTGGCT 


GTTTTGTTCT 


GTTGTATTTC 


107280 


TCCCTTTTGT 


TTTTTTGTTC 


GACTTACTGG 


107340 


TTGTAAATTT 


TCCAAAAAAG 


TTTTTCTTTC 


107400 


CGCCCAGGCT 


GGAGAATGGT 


GGTGCACAAT 


107460 


TTCAAGTGAT 


CCTCCCACCT 


CAGCCTTCCC 


107520 


CACCTGGCTA 


ATTTTTGTAT 


TTTAGTCTTA 


107580 


GACCTCAGGT 


GATCTGCCCG 


CCTTGGCCTC 


107640 


CCATGCCCAG 


CCCCCAATTT 


TTTTTTTTTA 


107700 


TGGTCTTGAA 


CTCCTGAGCT 


CAAGCTGTCA 


107760 


TACAGGTGTG 


AGTCACAGTA 


CCTGGCCTTC 


107820 


GGGCACGGTG 


GCTCACGCCT 


GTAATCCCAG 


107880 


GAGGTCAGGA 


GATCAAGACC 


ACCCTGGCTA 


107940 


ACAAAAAATT 


AACCAGGTGT 


GGTGGCAGGT 


108000 


AGCAGGAGAA 


TGGCGTGAAC 


CCGGGAGGTG 


108060 


GTACTCCAGC 


CTGGGCAACA 


GTGCGAGACT 


108120 


CTCACTGCCT 


TCTGGTCCAA 


TAGTTTCTGA 


108180 


ACATTTATAT 


ATTGACTAGT 


CACTTGTCTC 


108240 


GTTTCAGCAG 


TTTGATTATA ATGTATCAGT 


108300 


GTTCATTGGA 


CTTCTTGGAT 


GTGTAAATTC 


108360 


CTACTATTCT 


TTGCATCTTG 


AAATACTAGT 


108420 


GGAACTTTAT 


GCATACATTG 


ATGTGCTTCA 


108480 


CATTTTTCTT 


TGTTCTTTTT 


TTCTTTCTGC 


108540 


TCCTCAAGTT 


CACTGTTTCT 


TTCTTCTTCC 


108600 


AAATTTTCAC 


TACAGTTACT 


GTACTTTTTA 


108660 


TAGTAATTAT 


CACTTTACTA 


GTATTCTCTA 


108720 
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TTTGGTTAGA 


CATGGTTCTT 


TTGTTTTCCT 


TTAGTTCATT 


ATCCATGGTT 


TCCTTTATTT 


108780 




TTAAATTTCT 


TTTTATTTAG 


TTATTAATTT 


IT TTTT T, 1 *n*T 


TGAAGCGGGG 


TTTCACTCTT 


108840 




GTCACCCAGG 


CTGGCAGGCA 


ACGTCACAAT 


CTTGGCTCAC 


TACAACCTCC 


GCCTCCTGGG 


108900 




TTCAAGTGAT 


TCTCCTGCCT 


CAGCCTCCCA 


AGTAGCTGGG 


ATTATAGGCA 


TGTGCCACCA 


108960 


5 


CACCCACCTA 


ATTTTTGGTA 


TTTTTAGTAG 


AAACTGGGTT 


TCACCACATT 


GGCCAGACTG 


109020 




GTCTTAAACT 


ACTAACCTCA 


GGTGATCTGT 


CCGCCTCAGC 


CTCCCAAAAT 


GCTGGGATTA 


109080 




CAGATGTGAG 


CCACTGTGCC 


CAGCCTCTTT 


TTTTAGTGTA 


TTTAAGGTAA 


TTGATTGAAA 


109140 




GTTTTTGTCT 


AGTCATTCAA 


ATGTCTAGGC 


TTCCTCAGGA 


ACAGTTTCTA 


TTAATTTCTT 


109200 




TATTTTTAAA 


AAATTTTTTT 


TAATTTTCTT 


TTTTTTTTAG 


ATGGAGTCTC 


ACTCTATAGC 


109260 


10 


CTAGGCTGGA 


GTGCAATGGC 


TTGATCTTGG 


CTCACTGCAA 


CCTCTGCCTC 


CTGGGTTCAA 


109320 




GCGATTCTCC 


TGCTTCAGCC 


TCCTGAGTAG 


CTGGGACTAT 


AGGTGCGTGC 


CACCACTCCT 


109380 




GGCTAATTTT 


TTGTATTTTC 


AGTAGAGACA 


TGGTTTTGCC 


GTGTTAGCCA 


GGATGGTCTC 


109440 




GATCTCGTGA 


CCTCATGATC 


CTCCTGCCTC 


GGCCTCCCAA 


AGTGCTGGAA 


TTACAGGTGT 


109500 




GAGCCACCGC 


GCCCAGCCTA 


TTTTTTATTT 


TTTGAGACAA 


AGTCTCCCTC 


TCTCACCCAG 


109560 


15 


GCTGTAGTGC 


AGTGGCACAA 


CCCTGGCACA 


CTGCAGCCTT 


AACCGTCCAG 


GCTTAAGTGA 


109620 




GTCTCCCACC 


TTAGTCTCCT 


GAGTAGCTAG 


AACTACAAGC 


ATGTGCCACC 


ATGCCTGGCT 


109680 




GGTTGTGTTG 


TTACTGTTTT 


AGACACAGGG 


TCTTGCTACA 


TTTCTCTGAC 


TGGTCTTGAA 


109740 




CTCCTGGGCT 


CAAGCAGTCA 


TCCCACCTTG 


GCCTCCCAAG 


GTGTTGAGAT 


TACAGGTGTG 


109800 




AGCCACCGCA 


CCCGGCCTGT 


TAATTTCTTT 


ATTTCCGGTG 


AATGGGCCAC 


ACTTTCTTGT 


109860 


20 


TTCTTTGCAT 


GCCTTGTAAT 


TTTTTGTTGA 


AACCTGCACA 


ATTTGAAGAT 


GATAATGTGG 


109920 




TTACTTTGAA 


AATCAGATCC 


TCCGCCCTCT 


GCAGGGTTCA 


TTGTTGCTGT 


TTGTTGTGGA 


109980 




TTGTCGTTTC 


TCGTTTGTTT 


AGTTACTTTC 


CTGACCTTTT 


TAAATAAAGA 


CTATATTCTG 


110040 




TCAGGGGTGC 


TTGTTTCTGT 


TCTTTTAGGT 


TAGTGGTTAG 


CTTGTGCTTT 


GAAAGAGATT 


110100 




TCTTTAAATA 


TCTAGTGGCA 


AAAAGGATAA 


AGAGGCCGGG 


CGCAGTGGCT 


CACGCCTGTA 


110160 


25 


ATGCTAGGAC 


TTTGGGAAGT 


GGAGGCGGGT 


GGATCACTTG 


AGGTCAGGAG 


TTTAAGATCA 


110220 




GCCTGGCCAG 


TATGGTGAAA 


CCCTGTCTCT 


ACTAAAAATA 


CAAAAATTAA 


CCGGGCATGG 


110280 




TGGCACCTGC 


CTGTAGTCCC 


AGCTACTGGG 


AAGACTGAGG 


CAGGAGAATC 


GCTTCAATCC 


110340 




AGGGGGCGGA 


GGTTGCAGTG 


AGCTGAGATT 


GCGCCATTGC 


ACTCCAGCCT 


GGGCAACAGA 


110400 




GCGAGACTCT 


GTCTCAAATA 


AAAAAAAAAA 


AAAAAGGATA 


AAGAGTGTCT 


TCCATCCTTT 


110460 


30 


CCAGGTTGCC 


TCTGTACTGG 


GGCAAGTCCT 


TCAGTGTCCG 


CCAGGCTGTT 


CACGGCTTTT 


110520 




CCTCAGCCTT 


TACTTCTCGC 


TCCCATGGAG 


CCTAAGGATG 


AACCAGAGGT 


GAAAGTTGAG 


110580 




GGCCTCCTCA 


GGTGTTTCTG 


AGCCCCTGTC 


TAGCCCCAGC 


TGTGTGCATG 


GCCTTCTGGA 


110640 




TTTCCAAGCA 


TGAACAGGAG 


CTTTCCAAAG 


CCCTTAGACC 


TTCATGTAGC 


TCTTTTCCCA 


110700 




GCCTCTTCCT 


TCCTAGGCTT 


TTCTGTCAGC 


TCTTTGCCCA 


TCTGTTGTTG 


TCCCTCCCCC 


110760 
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ACAACTTCAG 


GTAGTATCTA 


CCTGTAAATG 


CCTTCAGGCC 


AGGCGCGGTG 


GCTCATACCT 


110820 




GTTATCCCAG 


CACTTTGGGA 


GGCCGAGGCG 


GGTGAATTGC 


TTGAGGTCAG 


GAGTTCGAGA 


110880 




CCAGCCTGGC 


CAACATGGTG 


AAGCCCCGTC 


TCTAGTAAAA 


ATACAAAAAT 


TAGCTGGGCG 


110940 




TGGTGGGTGC 


CTGTAATCTC 


AGCTACTCGG 


GAGGCTGAAG 


CAGGAGAATT 


GCTTGAGCCT 


111000 


5 


GGGAGGCGGA 


GGTTGCAGTG 


AGCTGAGATC 


GTGCCATTGC 


ACTCCAGCCT 


GGGCGACAGA 


111060 




GTGAGACTCC 


ATCTCGGGGA 


AAAAAAAAAA 


AAAAAAATGC 


CATCAACAGC 


ACGACCCTGG 


111120 




AGGCTGCCCC 


AGCCCTGAGA 


GAGTTCGAGG 


GGGTGAAACA 


AAGGCAAGCC 


CTTCAGGGAG 


111180 




ACACTAGAAA 


GATCCAAATG 


CATAAGCAGG 


ATTCCTTGAG 


AAAAGGTCTG 


TATCATCCCT 


111240 




TCTGACACCA 


GCAAGCCACA 


TCAGAAATAC 


AGGTTGCCTT 


CCCCATGGCT 


ACATGTGAGC 


111300 


10 


TGGTAGTAGT 


GGCTGAGCAG 


AAATAGCCCA 


GCTGTCCTCC 


TGAAATTTAG 


CAGGGTCTTA 


111360 




CTTCATTGAG 


CAGTCATCTG 


GTTCGTAGAC 


ACCAGAGTTA 


CAGAAAAGTT 


TATTGGGAGG 


111420 




TTTTGACAGT 


TTAATAGAAA 


AAAGTTTATT 


GTGACAGTTT 


TGACAGCTGA 


ATAGAAAAAA 


111480 




GTTTACTGTG 


ACAGTTTTGA 


CAGCAGAATA 


GTTGCTTTGC 


TGGAGAGACG 


GATCTTTGGA 


111540 




GCTGCCAACT 


CCATCATTTT 


GGTGATATCC 


AGCTCTGTTG 


CTGAATTTTT 


AGCTATGCTG 


111600 


15 


TTTTAAGTTA 


TTTTCTTAGT 


GGTTGCTCTA 


GAGATGACAA 


TGTGCATCTT 


TAACTTACCA 


111660 




CAATGTACTT 


CAGATTATTA 


CTAACTTAAC 


ACTTAAAGTA 


CAGCATTTTT 


TTTTTTATGG 


111720 




AGTTTCACTC 


TGTCACCCAG 


GCTGGAGTGC 


AATGGTGTGA 


TCTCGGCTCA 


CTGCAACCTC 


111780 




CGCCTCCCAG 


GTTCACGCCA 


TTCTCCTGCC 


TCAGCCTCCT 


GAGTAGCTGG 


GACTACAGGC 


111840 




ACCCCCACCA 


CACCCGGCTA 


ATTTTGTATT 


TTTAGTAGAG 


ATGAGGTTTC 


ACCATGTTGG 


111900 


20 


TCAGGCTGGT 


CTCGAACTGC 


TGACCTCAGG 


TGATCCGCCC 


ATCTTGGCCT 


CCCAAAGTGC 


111960 




TGGGATTACA 


GGTGTGAGCG 


ACTGCACTGA 


GCCTAAGTAT 


GGCAACGTGT 


CTATAACATA 


112020 




GATCTACTTC 


CGTTGTACTA 


TGACATAGTT 


CCCCCTCCAT 


TTTCCTATAG 


CACAGTCCCA 


112080 




ACCTCCCTTT 


TCCTCTGACA 


TAGTTCCATC 


CTCCCTCCTC 


CTATGACGTC 


CTCCCTTCTC 


112140 




CTCTGGCATA 


GCTCCATCCT 


CCCTTCTCCT 


ATGACACAGC 


TCCATCCTCC 


CTTCTCCTCT 


112200 


25 


GACACAGCTC 


CATCCTCCCT 


TCTCCTATGA 


CACAGCTCCA 


TCCTCCCTTC 


TCCTCTGACA 


112260 




TAGCTCCATC 


CTCCCTTCTC 


CTATGTCATA 


GCTCCATCCT 


CCCTTCTCCT 


CTGACACAGC 


112320 




TCCATCCTCC 


CTTCTCCTCT 


GGCATAGCTC 


CATCCTCCCT 


TCTCCTATGA 


CACAGCTCCA 


112380 




TCCTCCCTTC 


TCCTATGACA 


CAGCTCCATC 


CTCCCTTCTC 


CTATGACACA 


GCTCCATCCT 


112440 




CCCTTCTCCT 


ATGACACAGC 


TCCATCCTCC 


CTTCTCCTCT 


GGCATAGCTC 


CATCCTCCCT 


112500 


30 


TCTCCTCTGA 


CATAGCTCCA 


TCCTCCCTTC 


TCCTCTGACA 


TAGCTCCATC 


CTCCCTTCTC 


112560 




CTCTGACATA 


GCTCCATCCT 


CCCTTCTCCT 


CTGACATAGC 


TCCATCCTCC 


CTTCTCCTCT 


112620 




GACATAGCTC 


CATCCTCCCT 


TCTCCTCTGA 


CATAGTTCCA 


TCCTCCCTTG 


TCCTCTGACA 


112680 




TAGCTCCATC 


CTCCCTTCTC 


CTCTGACATA 


GCTCCATCCC 


CTCTTCTCCT 


TCATGTATTA 


112740 




TTGCCATATA 


TACATTTATG 


TATGTTATAA 


CTTCAGCTCT 


TCAGCGTTAT 


AATTATTGCT 


112800 
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TCAAAAGTAT 
TTTTGGGGGC 
CATGACGAAA 
CTGTAGTCCC 
GGAAGTCAAG 
TTATCTCAGA 
GAGGCTGAGG 
TGAAACCCCG 
ATCCCAGCTA 
CAGTGAGCAA 
AAAAAAAAAA 
AATCCCAAAG 
AGCCTGGGCA 
AAAAAGTATA 
TTCTTTCCTG 
ATTTAGAATT 
AATGGTGCAG 
TCAGCCTCCC 
GTATTTTTAG 
TCATGATCTG 
GCCCATTTAG 
AGGCTGGAGT 
CATTCTCCTG 
CTAATTTTTT 
TCTCCTGACC 
GCCACCGCGC 
TATTTGGGAA 
ATTATTAATT 
CATTGTTTCT 
TTTTTCTCTT 



TTTGAAAGAA 
TGAGGTGGGC 
CCCATCTCCA 
AGCTATTTGG 
GCTGCAGTGA 
AGTAAAGGGA 
TGAGTGGATC 
TCTCTACTAA 
CTCGGGAGGC 
GATCGCACCA 
AAAAAGAAAA 
CCTTGGGAGG 
ACATAGCGAG 
CTTGTTTTTT 
TGAATTTGAG 

TCTCAGATCA 
AAGTAGCTGA 
TAGAGACAGG 
CCTGTCCTGG 
AATTTCTTTT 
GCAGTGGCAC 
CCTCAGCCTC 
GTATTTTTAG 
TCGTGATCCG 
CCGGCTAGAA 
TGTCTTTATT 
GATCATTCTT 
GATGAGAAGT 
GATGTTTTCA 



GTTGCCTGGA 
AGATCGCCTG 
CCAAAATTAC 
GGGAGGATCC 
GCTGAGATTG 
CTAGGAATGG 
ACCGGAGGTC 
AAATACAAAA 
TGAGGCAAGA 
CTGCATTACA 
GAAAAGAAAT 
CCAAGGCAGG 
ACTCCATCTC 
TGTATCATCC 
TTACTGTCTA 
TTTGAGACAA 
CTGCAACCTC 
GACTGCAGGC 
GTTTCACTAT 
CCTCCCAAAA 

GATCTCGGCT 
CCGAGTAGCT 
GAGAGATGGG 
CCCGCCTTGG 
TTTCTTGTAG 
TCAGCTTCAT 
TTCAGTGTTT 
CATCTGTCAA 
AGATTTTCTC 



GGCAGTGGCT 
AGCCAGGGAG 
AAAAAATTAG 
CAGCTAAGGT 
TGCCACTGCA 
TGGCTTTTAT 
AGGAGTTTAA 
AGTAGCCGGG 
GAATCGCTTG 
GCCTAGATGA 
TAAGATCTAG 
AGGATCACTT 
TATTTAAAAA 
ATATTTTACC 
GTGTCATTTC 
AGTCTCACTG 
TGCCTCCCTG 
ACCTGCCACC 
GTTGGCCAGG 
TGCTGGGATT 
TTTTGAGATG 
CACTGCGAGC 
GGGATTACAG 
GTTTCACCAT 
CCTCCCAAAG 
GACAGGCTTG 
TTTTTGAAGG 
AAAAGTGTCA 
ATTGTCCCTT 
TTTGTCTTTG 



TATGCCTTTA 
TTGGAGACCA 
TCTGGCATGG 
GGGAGGATCA 
CTCCAGCCTG 
CTCTAATCCC 
GACCAGCCTG 
TGTGGTGGTG 
AACCTGGGAA 
CAGAGCGAGA 
ACACTGTGGT 
GAGGCCAGGA 
AGAAAGAAAT 
TTTCTTTTTT 
CTTTTAGTCT 
TGTTGCCCAG 
GTTAGAGTGA 
ACCCCCAGCC 
CTGGTCTCGA 
ACCATGAGCC 
GGGTCTCGCT 
TCCGCCTCCC 
GCGCCTGCCA 
GTTAGCCAGG 
TGCTGGGATT 
CTAGCAACCA 
ATAGTTTAGC 
TCATGCTACC 
TGTACTTGAA 
GCCTTTAGTA 



ACTCCAGCAC 
GCCTGGGCAA 
TGGCACGCGC 
CTTGAGCCTG 
GGTGCAGATC 
AGCACTTTGG 
GCCAACATGG 
GGTGTCTGTA 
GCGGAGGTTG 
CTCTGCCTAA 
TCATGCCTGT 
GTTCAACACC 
TCAAAGAGAA 
TTGCCCCTTT 
GAAGAACTTC 
GCTGGAGTGC 
TTTTCCTGCC 
AATTTTTTTG 
ATTCATGACC 
icCACGCCCA 
CTTGTTTCCC 
GGGTTCACGC 
CCACGCCCAC 
ATGGTCTTGA 
ACAGGCGTGA 
ATTCAGTGTT 
TGGCTATAGA 
TTCTGGGTTC 
GAATTATCTT 
GTTTGTGATG 



112860 
112920 
112980 
113040 
113100 
113160 
113220 
113280 
113340 
113400 
113460 
113520 
113580 
113640 
113700 
113760 
113820 
113880 
113940 
114000 
114060 
114120 
114180 
114240 
114300 
114360 
114420 
114480 
114540 
114600 
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TATCTAGGTG TGGATCTCTT GGTGTGCATC GTATTTGGGC TTCAGTAAGC CTCTTAGATT 114660 

CATAGATTAA TGTTTTGTTT TGTTTTACCA AATTTGGAGA GTTTTTACTC ATCATTTCAA 114720 

CAAATTTTTT TCCTGCCCCT CTCTCATCTC CTTTTGGGAG TACCACTGCA TGTATGTTGG 114780 

TGTGCGTTCT CTA (SEQ ID NO: 3) 114793. 

5 The present invention also relates to a portion of SEQ ID NO:3 which comprises 5* 

regulatory regions, exons, introns and 3' non-translated regions which comprise the human 
NHL gene of the present invention. Such regulatory sequence may be found within the 
various regions of this 1 15 kb fragment. The 5' portion of SEQ ID NO:l begins at nucleotide 
47095 of SEQ ID NO:3, the initiating ATG of human NHL is from nucleotide 48687-48689 

10 of SEQ ID NO:3, the termination TAG' codon is from nucleotide 84855-84857, while the 3' 
terminus of SEQ ID NO:l as disclosed herein (GCAGTGCCC) corresponds to nucleotides 
85308-85316. To this end, one preferred aspect of the invention is an isolated genomic 
fragment or fragments which comprise from about nucleotide 470000 to about nucleotide 
85500 of SEQ ID NO:3), which comprises the portion of the genomic clone encoding the 

15 mRNA transcript responsible for human NHL (see Figure 5A-B). The genomic sequence 
encoding NHL contains 35 exons (Figure 5 A). An especially preferred aspect of the 
invention is a human genomic fragment or fragments which comprise from about nucleotide 
47095 to about nucleotide 85316 of SEQ ID NO:3. As noted in regard to SEQ ID NO:l, the 
present invention also relates to DNA vectors and recombinant hosts which comprise at least 

20 a portion of SEQ ID NO:3. Portions of the 1 15 kb genomic fragment may be housed in 

multiple vector/hosts so as to optimize handling of the DNA sequences within SEQ ID NO:3. 
Therefore, the present invention relates to the isolated genomic sequence which set forth as 
SEQ ID NO:3, a region of SEQ ID NO:3 which contains the coding and non-coding region oi 
human NHL, as well as ris-acting sequences within SEQ ID NO:3 which effect regulation of 

25 transcription of one or more of the genes localized within this 1 15 kb human genomic 

fragment, including regulatory regions effecting levels of NHL, M68/DcR3, SCLIP and ARP. 
As noted above, this region of chromosome 20 (20ql3.3) is associated with tumor growth. 
Therefore, an aspect of this invention also comprises, as one example, the use of one or more 
regulatory regions of this 115 kb genomic sequence as a target to antagonize the effect of a 

30 transcriptional factor(s) which normally upregulate expression of a gene which has a caustic 
role in tumor growth. Alternatively, compounds may be selected which interacts with a 
specific cw-acting sequence to upregulate a gene within this region, where upregulation 
results in a decrease in tumor growth. 

The present invention is also directed to methods of screening for compounds 
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which modulate the expression of DNA or RNA encoding a NHL protein. 
Compounds which modulate these activities may be DNA, RNA, peptides, proteins, 
or non-proteinaceous organic molecules. Compounds may modulate by increasing or 
attenuating the expression of DNA or RNA encoding NHL, or the function of the 

5 NHL-based protein. Compounds that modulate the expression of DNA or RNA 
encoding NHL or the biological function thereof may be detected by a variety of 
assays. The assay may be a simple "yes/no" assay to determine whether there is a 
change in expression or function. The assay may be made quantitative by comparing 
the expression or function of a test sample with the levels of expression or function in 

10 a standard sample. Kits containing NHL, antibodies to NHL, or modified NHL may 
be prepared by known methods for such uses. 

The DNA molecules, RNA molecules, recombinant protein and antibodies of 
the present invention may be used to screen and measure levels of NHL. The 
recombinant proteins, DNA molecules, RNA molecules and antibodies lend 

15 themselves to the formulation of kits suitable for the detection and typing of NHL. 
Such a kit would comprise a compartmentalized carrier suitable to hold in close 
confinement at least one container. The carrier would further comprise reagents such 
as recombinant NHL or anti-NHL antibodies suitable for detecting NHL. The carrier 
may also contain a means for detection such as labeled antigen or enzyme substrates 

20 or the like. 

The assays described above can be carried out with cells that have been 
transiently or stably transfected with NHL. The expression vector may be introduced 
into host cells via any one of a number of techniques including but not limited to 
transformation, transfection, protoplast fusion, and electroporation. Transfection is 

25 meant to include any method known in the art for introducing NHL into the test cells. 
For example, transfection includes calcium phosphate or calcium chloride mediated 
transfection, lipofection, infection with a retroviral construct containing NHL, and 
electroporation. The expression vector-containing cells are individually analyzed to 
determine whether they produce NHL protein. Identification of NHL expressing cells 

30 may be done by several means, including but not limited to immunological reactivity 
with anti-NHL antibodies, labeled ligand binding, the presence of host cell-associated 
NHL activity. 

The specificity of binding of compounds showing affinity for NHL is shown 
by measuring the affinity of the compounds for recombinant cells expressing NHL. 
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Expression of human NHL and screening for compounds that bind to NHL or that 
inhibit the binding of a known, radiolabeled ligand of NHL provides an effective 
method for the rapid selection of compounds with high affinity for NHL. Such 
ligands need not necessarily be radiolabeled but can also be nonisotopic compounds 

5 that can be used to displace bound radiolabeled compounds or that can be used as 
activators in functional assays. Compounds identified by the above method are likely 
to be agonists or antagonists of NHL and may be peptides, proteins, or non- 
proteinaceous organic molecules. 

Accordingly, the present invention is directed to methods for screening for 

10 compounds which modulate the expression of DNA or RNA encoding a NHL protein 
as well as compounds which effect the function of the NHL protein. Methods for 
identifying agonists and antagonists of other receptors are well known in the art and 
can be adapted to identify agonists and antagonists of NHL. For example, 
Cascieri et al. (1992, Molec. Pharmacol. 41:1096-1099) describe a method for 

15 identifying substances that inhibit agonist binding to rat neurokinin receptors and thus 
are potential agonists or antagonists of neurokinin receptors. The method involves 
transfecting COS cells with expression vectors containing rat neurokinin receptors, 
allowing the transfected cells to grow for a time sufficient to allow the neurokinin 
receptors to be expressed, harvesting the transfected cells and resuspending the cells 

20 in assay buffer containing a known radioacti vely labeled agonist of the neurokinin 
receptors either in the presence or the absence of the substance, and then measuring 
the binding of the radioactively labeled known agonist of the neurokinin receptor to 
the neurokinin receptor. If the amount of binding of the known agonist is less in the 
presence of the substance than in the absence of the substance, then the substance is a 

25 potential agonist or antagonist of the neurokinin receptor. Where binding of the 
substance such as an agonist or antagonist to is measured, such binding can be 
measured by employing a labeled substance or agonist. The substance or agonist can 
be labeled in any convenient manner known to the art, e.g., radioactively, 
fluorescently, enzymatically. 

30 Therefore, the present invention includes assays by which modulators of NHL 

are identified. As noted above, methods for identifying agonists and antagonists are 
known in the art and can be adapted to identify compounds which effect in vivo levels 
of NHL. Accordingly, the present invention includes a method for determining 
whether a substance is a potential modulator of mammalian NHL levels that 
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comprises: 

(a) providing test cells by transfecting cells with an expression 
vector that directs the expression of NHL in the cells; 

(b) exposing the test cells to the substance; 

5 (c) measuring the amount of binding of the substance to NHL; 

(d) comparing the amount of binding of the substance to NHL in 
the test cells with the amount of binding of the substance to control cells that have not 
been transfected with NHL or a portion thereof; wherein if the amount of binding of 
the substance is greater in the test cells as compared to the control cells, the substance 

10 is capable of binding to NHL. 

The conditions under which step (b) of the method is practiced are conditions 
that are typically used in the art for the study of protein-ligand interactions: e.g., 
physiological pH; salt conditions such as those represented by such commonly used 
buffers as PBS or in tissue culture media; a temperature of about 4°C to about 55°C. 

15 The assays described above can be carried out with cells that have been 

transiently or stably transfected with NHL. Transfection is meant to include any 
method known in the art for introducing NHL into the test cells. For example, 
transfection includes calcium phosphate or calcium chloride mediated transfection, 
lipofection, infection with a retroviral construct containing NHL, and electroporation. 

20 Where binding of the substance or agonist to NHL is measured, such binding 

can be measured by employing a labeled substance or agonist. The substance or 
agonist can be labeled in any convenient manner known to the art, e.g., radioactively, 
fluorescently, enzymatically. 

Therefore, the specificity of binding of compounds having affinity for NHL 

25 shown by measuring the affinity of the compounds for recombinant cells expressing 
the cloned receptor or for membranes from these cells. Expression of the cloned 
receptor and screening for compounds that bind to NHL or that inhibit the binding of a 
known, radiolabeled ligand of NHL to these cells provides an effective method for the 
rapid selection of compounds with high affinity for NHL. Such ligands need not 

30 necessarily be radiolabeled but can also be nonisotopic compounds that can be used to 
displace bound radiolabeled compounds or that can be used as activators in functional 
assays. It is also possible to construct assays wherein compounds are tested for an 
ability to modulate helicase activity in an in vitro- or in vivo- based assay. 
Compounds identified by the above method again are likely to be agonists or 
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antagonists of NHL and may be peptides, proteins, or non-proteinaceous organic 
molecules. As noted elsewhere in this specification, compounds may modulate by 
increasing or attenuating the expression of DNA or RNA encoding NHL, or by acting 
as an agonist or antagonist of the NHL receptor protein. Again, these compounds that 

5 modulate the expression of DNA or RNA encoding NHL or the biological function 
thereof may be detected by a variety of assays. The assay may be a simple "yes/no" 
assay to determine whether there is a change in expression or function. The assay may 
be made quantitative by comparing the expression or function of a test sample with 
the levels of expression or function in a standard sample. 

10 Expression of NHL DNA may also be performed using in vitro produced 

synthetic mRNA. Synthetic mRNA can be efficiently translated in various cell-free 
systems, including but not limited to wheat germ extracts and reticulocyte extracts, as 
well as efficiently translated in cell based systems, including but not limited to 
microinjection into frog oocytes, with microinjection into frog oocytes being 

15 preferred. 

Following expression of NHL in a host cell, NHL protein may be recovered to 
provide NHL protein in active form. Several NHL protein purification procedures are 
available and suitable for use. Recombinant NHL protein may be purified from cell 
lysates and extracts by various combinations of, or individual application of salt 

20 fractionation, ion exchange chromatography, size exclusion chromatography, 
hydroxylapatite adsorption chromatography and hydrophobic interaction 
chromatography. In addition, recombinant NHL protein can be separated from other 
cellular proteins by use of an immunoaffinity column made with monoclonal or 
polyclonal antibodies specific for full-length NHL protein, or polypeptide fragments 

25 of NHL protein. 

Polyclonal or monoclonal antibodies may be raised against NHL or a synthetic 
peptide (usually from about 9 to about 25 amino acids in length) from a portion of 
NHL disclosed in SEQ ID NO:2. Monospecific antibodies to NHL are purified from 
mammalian antisera containing antibodies reactive against NHL or are prepared as 

30 monoclonal antibodies reactive with NHL using the technique of Kohler and Milstein 
(1975, Nature 256: 495-497). Monospecific antibody as used herein is defined as a 
single antibody species or multiple antibody species with homogenous binding 
characteristics for NHL. Homogenous binding as used herein refers to the ability of 
the antibody species to bind to a specific antigen or epitope, such as those associated 
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with NHL, as described above. Human NHL-specific antibodies are raised by 
immunizing animals such as mice, rats, guinea pigs, rabbits, goats, horses and the 
like, with an appropriate concentration of NHL protein or a synthetic peptide 
generated from a portion of NHL with or without an immune adjuvant. 

5 Preimmune serum is collected prior to the first immunization. Each animal 

receives between about 0.1 mg and about 1000 mg of NHL protein associated with an 
acceptable immune adjuvant. Such acceptable adjuvants include, but are not limited 
to, Freund's complete, Freund's incomplete, alum-precipitate, water in oil emulsion 
containing Corynebacteriwn parvum and tRNA. The initial immunization consists of 

10 NHL protein or peptide fragment thereof in, preferably, Freund's complete adjuvant at 
multiple sites either subcutaneously (SC), intraperitoneal ly (IP) or both. Each animal 
is bled at regular intervals, preferably weekly, to determine antibody titer. The 
animals may or may not receive booster injections following the initial immunization. 
Those animals receiving booster injections are generally given an equal amount of 

15 NHL in Freund's incomplete adjuvant by the same route. Booster injections are given 
at about three week intervals until maximal titers are obtained. At about 7 days after 
each booster immunization or about weekly after a single immunization, the animals 
are bled, the serum collected, and aliquots are stored at about -20°C. 

Monoclonal antibodies (mAb) reactive with NHL are prepared by immunizing 

20 inbred mice, preferably Balb/c, with NHL protein. The mice are immunized by the IP 
or SC route with about 1 mg to about 100 mg, preferably about 10 mg, of NHL 
protein in about 0.5 ml buffer or saline incorporated in an equal volume of an 
acceptable adjuvant, as discussed above. Freund's complete adjuvant is preferred. 
The mice receive an initial immunization on day 0 and are rested for about 3 to about 

25 30 weeks. Immunized mice are given one or more booster immunizations of about 1 
to about 100 mg of NHL in a buffer solution such as phosphate buffered saline by the 
intravenous (TV) route. Lymphocytes, from antibody positive mice, preferably splenic 
lymphocytes, are obtained by removing spleens from immunized mice by standard 
procedures known in the art. Hybridoma cells are produced by mixing the splenic 

30 lymphocytes with an appropriate fusion partner, preferably myeloma cells, under 

conditions which will allow the formation of stable hybridomas. Fusion partners may 
include, but are not limited to: mouse myelomas P3/NSl/Ag 4-1; MPC-1 1; S-194 and 
Sp 2/0, with Sp 2/0 being preferred. The antibody producing cells and myeloma cells 
are fused in polyethylene glycol, about 1000 mol. wt., at concentrations from about 
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30% to about 50%. Fused hybridoma cells are selected by growth in hypoxanthine, 
thymidine and aminopterin supplemented Dulbecco's Modified Eagles Medium 
(DMEM) by procedures known in the art. Supernatant fluids are collected form 
growth positive wells on about days 14, 18, and 21 and are screened for antibody 

5 production by an immunoassay such as solid phase immunoradioassay (SPIRA) using 
NHL as the antigen. The culture fluids are also tested in the Ouchterlony precipitation 
assay to determine the isotype of the mAb. Hybridoma cells from antibody positive 
wells are cloned by a technique such as the soft agar technique of MacPherson, 1973, 
Soft Agar Techniques, in Tissue Culture Methods and Applications, Kruse and 

10 Paterson, Eds., Academic Press. 

Monoclonal antibodies are produced in vivo by injection of pristine primed 
Balb/c mice, approximately 0.5 ml per mouse, with about 2 x 10 6 to about 6 x 10 6 
hybridoma cells about 4 days after priming. Ascites fluid is collected at 
approximately 8-12 days after cell transfer and the monoclonal antibodies are purified 

15 by techniques known in the art. 

In vitro production of anti- NHL mAb is carried out by growing the hybridoma 
in DMEM containing about 2% fetal calf serum to obtain sufficient quantities of the 
specific mAb. The mAb are purified by techniques known in the art. 

Antibody titers of ascites or hybridoma culture fluids are determined by 

20 various serological or immunological assays which include, but are not limited to, 
precipitation, passive agglutination, enzyme-linked immunosorbent antibody (ELISA) 
technique and radioimmunoassay (RIA) techniques. Similar assays are used to detect 
the presence of NHL in body fluids or tissue and cell extracts. 

It is readily apparent to those skilled in the art that the above described 

25 methods for producing monospecific antibodies may be utilized to produce antibodies 
specific for NHL peptide fragments, or a respective full-length NHL. 

NHL antibody affinity columns are made, for example, by adding the 
antibodies to Affigel-10 (Biorad), a gel support which is pre-activated with N- 
hydroxysuccinimide esters such that the antibodies form covalent linkages with the 

30 agarose gel bead support. The antibodies are then coupled to the gel via amide bonds 
with the spacer arm. The remaining activated esters are then quenched with 1M 
ethanolamine HC1 (pH 8). The column is washed with water followed by 0.23 M 
glycine HC1 (pH 2.6) to remove any non-conjugated antibody or extraneous protein. 
The column is then equilibrated in phosphate buffered saline (pH 7.3) and the cell 
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culture supernatants or cell extracts containing full-length NHL or NHL protein 
fragments are slowly passed through the column. The column is then washed with 
phosphate buffered saline until the optical density (A280) falls to background, then the 
protein is eluted with 0.23 M glycine-HCl (pH 2.6). The purified NHL protein is then 

5 dialyzed against phosphate buffered saline. 

Pharmaceutical ly useful compositions comprising modulators of NHL may be 
formulated according to known methods such as by the admixture of a 
pharmaceutically acceptable carrier. Examples of such carriers and methods of 
formulation may be found in Remington's Pharmaceutical Sciences. To form a 

10 pharmaceutically acceptable composition suitable for effective administration, such 
compositions will contain an effective amount of the protein, DNA, RNA, modified 
NHL, or either NHL agonists or antagonists including tyrosine kinase activators or 
inhibitors. 

Therapeutic or diagnostic compositions of the invention are administered to an 

15 individual in amounts sufficient to treat or diagnose disorders. The effective amount 
may vary according to a variety of factors such as the individual's condition, weight, 
sex and age. Other factors include the mode of administration. 

The pharmaceutical compositions may be provided to the individual by a 
variety of routes such as subcutaneous, topical, oral and intramuscular. 

20 The term "chemical derivative" describes a molecule that contains additional 

chemical moieties which are not normally a part of the base molecule. Such moieties 
may improve the solubility, half-life, absorption, etc. of the base molecule. 
Alternatively the moieties may attenuate undesirable side effects of the base molecule 
or decrease the toxicity of the base molecule. Examples of such moieties are 

25 described in a variety of texts, such as Remington's Pharmaceutical Sciences. 

Compounds identified according to the methods disclosed herein may be used 
alone at appropriate dosages. Alternatively, co-administration or sequential 
administration of other agents may be desirable. 

The present invention also has the objective of providing suitable topical, oral, 

30 systemic and parenteral pharmaceutical formulations for use in the novel methods of 
treatment of the present invention. The compositions containing compounds 
identified according to this invention as the active ingredient can be administered in a 
wide variety of therapeutic dosage forms in conventional vehicles for administration. 
For example, the compounds can be administered in such oral dosage forms as tablets, 
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capsules (each including timed release and sustained release formulations), pills, 
powders, granules, elixirs, tinctures, solutions, suspensions, syrups and emulsions, or 
by injection. Likewise, they may also be administered in intravenous (both bolus and 
infusion), intraperitoneal, subcutaneous, topical with or without occlusion, or 

5 intramuscular form, all using forms well known to those of ordinary skill in the 
pharmaceutical arts. 

Advantageously, compounds of the present invention may be administered in a 
single daily dose, or the total daily dosage may be administered in divided doses of 
two, three or four times daily. Furthermore, compounds for the present invention can 

10 be administered in intranasal form via topical use of suitable intranasal vehicles, or 
via transdermal routes, using those forms of transdermal skin patches well known to 
those of ordinary skill in that art. To be administered in the form of a transdermal 
delivery system, the dosage administration will, of course, be continuous rather than 
intermittent throughout the dosage regimen. 

15 For combination treatment with more than one active agent, where the active 

agents are in separate dosage formulations, the active agents can be administered 
concurrently, or they each can be administered at separately staggered times. 

The dosage regimen utilizing the compounds of the present invention is 
selected in accordance with a variety of factors including type, species, age, weight, 

20 sex and medical condition of the patient; the severity of the condition to be treated; 
the route of administration; the renal, hepatic and cardiovascular function of the 
patient; and the particular compound thereof employed. A physician or veterinarian 
of ordinary skill can readily determine and prescribe the effective amount of the drug 
required to prevent, counter or arrest the progress of the condition. Optimal precision 

25 in achieving concentrations of drug within the range that yields efficacy without 
toxicity requires a regimen based on the kinetics of the drug's availability to target 
sites. This involves a consideration of the distribution, equilibrium, and elimination 
of a drug. 

The present invention also relates to a non-human transgenic animal which is 
30 useful for studying the ability of a variety of compounds to act as modulators of NHL, 
or any alternative functional NHL in vivo by providing cells for culture, in vitro. In 
reference to the transgenic animals of this invention, reference is made to transgenes 
and genes. As used herein, a transgene is a genetic construct including a gene. The 
transgene is integrated into one or more chromosomes in the cells in an animal by 
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methods known in the art. Once integrated, the transgene is carried in at least one 
place in the chromosomes of a transgenic animal. Of course, a gene is a nucleotide 
sequence that encodes a protein, such as one or a combination of the cDNA clones 
described herein. The gene and/or transgene may also include genetic regulatory 

5 elements and/or structural elements known in the art. A type of target cell for 

transgene introduction is the embryonic stem cell (ES). ES cells can be obtained from 
pre-implantation embryos cultured in vitro and fused with embryos (Evans et al., 
1981, Nature 292:154-156; Bradley et al., 1984, Nature 309:255-258; Gossler et al., 
1986, Proc. Natl. Acad. ScL USA 83:9065-9069; and Robertson et al., 1986 Nature 

10 322:445-448). Transgenes can be efficiently introduced into the ES cells by a variety 
of standard techniques such as DNA transfection, microinjection, or by retrovirus- 
mediated transduction. The resultant transformed ES cells can thereafter be combined 
with blastocysts from a non-human animal. The introduced ES cells thereafter 
colonize the embryo and contribute to the germ line of the resulting chimeric animal 

15 (Jaenisch, 1988, Science 240: 1468-1474). It will also be within the purview of the 
skilled artisan to produce transgenic or knock-out invertebrate animals 
(e.g., C. elegans) which express the NHL transgene in a wild type background as well 
in C. elegans mutants knocked out for one or both of the NHL subunits. These 
organisms will be helpful in further determining the dominant negative effect of NHL 

20 as well as selecting from compounds which modulate this effect. 

The present invention also relates to a non-human transgenic animal which is 
heterozygous for a functional NHL gene native to that animal. As used herein, 
functional is used to describe a gene or protein that, when present in a cell or in vitro 
system, performs normally as if in a native or unaltered condition or environment. 

25 The animal of this aspect of the invention is useful for the study of the retinal specific 
expression or activity of NHL in an animal having only one functional copy of the 
gene. The animal is also useful for studying the ability of a variety of compounds to 
act as modulators of NHL activity or expression in vivo or, by providing cells for 
culture, in vitro. It is reiterated that as used herein, a modulator is a compound that 

30 causes a change in the expression or activity of NHL, or causes a change in the effect 
of the interaction of NHL with its ligand(s), or other protein(s). In an embodiment of 
this aspect, the animal is used in a method for the preparation of a further animal 
which lacks a functional native NHL gene. In another embodiment, the animal of this 
aspect is used in a method to prepare an animal which expresses a non-native NHL 
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gene in the absence of the expression of a native NHL gene. In particular 
embodiments the non-human animal is a mouse. In further embodiments the non- 
native NHL is a wild-type human NHL which is disclosed herein, or any other 
biologically equivalent form of human NHL gene as also disclosed herein. 

5 In reference to the transgenic animals of this invention, reference is made to 

transgenes and genes. As used herein, a transgene is a genetic construct including a 
gene. The transgene is integrated into one or more chromosomes in the cells in an 
animal by methods known in the art. Once integrated, the transgene is carried in at 
least one place in the chromosomes of a transgenic animal. Of course, a gene is a 

10 nucleotide sequence that encodes a protein, such as human or mouse NHL. The gene 
and/or transgene may also include genetic regulatory elements and/or structural 
elements known in the art. 

Another aspect of the invention is a non-human animal embryo deficient for 
native NHL expression, This embryo is useful in studying the effects of the lack of 

15 NHL on the developing animal. In particular embodiments the animal is a mouse. 
The animal embryo is also useful as a source of cells lacking a functional native NHL 
gene. The cells are useful in in vitro culture studies in the absence of NHL. 

An aspect of this invention is a method to obtain an animal in which the cells 
lack a functional gene NHL native to the animal. The method includes providing a 

20 gene for an altered form of the NHL gene native to the animal in the form of a 

transgene and targeting the transgene into a chromosome of the animal at the place of 
the native NHL gene. The transgene can be introduced into the embryonic stem cells 
by a variety of methods known in the art, including electroporation, microinjection, 
and lipofection. Cells carrying the transgene can then be injected into blastocysts 

25 which are then implanted into pseudopregnant animals. In alternate embodiments, the 
transgene-targeted embryonic stem cells can be coincubated with fertilized eggs or 
morulae followed by implantation into females. After gestation, the animals obtained 
are chimeric founder transgenic animals. The founder animals can be used in further 
embodiments to cross with wild-type animals to produce Fl animals heterozygous for 

30 the altered NHL gene. In further embodiments, these heterozygous animals can be 
interbred to obtain the non-viable transgenic embryos whose somatic and germ cells 
are homozygous for the altered NHL gene and thereby lack a functional NHL gene. In 
other embodiments, the heterozygous animals can be used to produce cells lines. In 
preferred embodiments, the animals are mice. 
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A further aspect of the present invention is a transgenic non-human animal 
which expresses a non-native NHL on a native NHL null background. In particular 
embodiments, the null background is generated by producing an animal with an 
altered native NHL gene that is non-functional, i.e. a knockout. The animal can be 

5 heterozygous (i.e., having a different allelic representation of a gene on each of a pair 
of chromosomes of a diploid genome) or homozygous (i.e., having the same 
representation of a gene on each of a pair of chromosomes of a diploid genome) for 
the altered NHL gene and can be hemizygous (i.e., having a gene represented on only 
one of a pair of chromosomes of a diploid genome) or homozygous for the non-native 

10 NHL gene. In preferred embodiments, the animal is a mouse. In particular 

embodiments the non-native NHL gene can be a wild-type or mutant allele including 
those mutant alleles associated with a disease. In further embodiments, the non-native 
NHL is a human NHL. In a further embodiment the non-native NHL gene is operably 
linked to a promoter. As used herein, operably linked is used to denote a functional 

15 connection between two elements whose orientation relevant to one another can vary. 
In this particular case, it is understood in the art that a promoter can be operably 
linked to the coding sequence of a gene to direct the expression of the coding 
sequence while placed at various distances from the coding sequence in a genetic 
construct. 

20 An aspect of this invention is a method of producing transgenic animals 

having a transgene including a non-native NHL gene on a native NHL null 
background. The method includes providing transgenic animals of this invention 
whose cells are heterozygous for a native gene encoding a functional NHL protein and 
an altered native NHL gene. These animals are crossed with transgenic animals of 

25 this invention that are hemizygous for a transgene including a non-native NHL gene to 
obtain animals that are both heterozygous for an altered native NHL gene and 
hemizygous for a non-native NHL gene. The latter animals are interbred to obtain 
animals that are homozygous or hemizygous for the non-native NHL and are 
homozygous for the altered native NHL gene. In particular embodiments, cell lines 

30 are produced from any of the animals produced in the steps of the method. 

The transgenic animals and cells of this invention are useful in the 
determination of the in vivo function of a non-native NHL in the central nervous 
. system and in other tissues of an animal. The animals are also useful in studying the 
tissue and temporal specific expression patterns of a non-native NHL throughout the 
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animals. The animals are also useful in determining the ability for various forms of 
wild-type and mutant alleles of a non-native NHL to rescue the native NHL null 
deficiency. The animals are also useful for identifying and studying the ability of a 
variety of compounds to act as modulators of the expression or activity of a non-native 

5 NHL in vivo, or by providing cells for culture, for in vitro studies. 

As used herein, a "targeted gene" or "Knockout" (KO) is a DNA sequence 
introduced into the germline of a non-human animal by way of human intervention, 
including but not limited to, the methods described herein. The targeted genes of the 
invention include nucleic acid sequences which are designed to specifically alter 

10 cognate endogenous alleles. An altered NHL gene should not fully encode the same 
NHL as native to the host animal, and its expression product can be altered to a minor 
or great degree, or absent altogether. In cases where it is useful to express a non- 
native NHL gene in a transgenic animal in the absence of a native NHL gene we 
prefer that the altered NHL gene induce a null lethal knockout phenotype in the 

15 animal. However a more modestly modified NHL gene can also be useful and is 
within the scope of the present invention. 

A type of target cell for transgene introduction is the embryonic stem cell (ES). 
ES cells can be obtained from pre-implantation embryos cultured in vitro and fused 
with embryos (Evans et al., 1981, Nature 292:154-156; Bradley et al., 1984, Nature 

20 309:255-258; Gossler et al., 1986, Proc. Natl Acad. Sci. USA 83:9065-9069; and 

Robertson et al., 1986 Nature 322:445-448). Transgenes can be efficiently introduced 
into the ES cells by a variety of standard techniques such as DNA transfection, 
microinjection, or by retrovirus-mediated transduction. The resultant transformed ES 
cells can thereafter be combined with blastocysts from a non-human animal. The 

25 introduced ES cells thereafter colonize the embryo and contribute to the germ line of 
the resulting chimeric animal (Jaenisch, 1988, Science 240: 1468-1474). 

The methods for evaluating the targeted recombination events as well as the 
resulting knockout mice are readily available and known in the art. Such methods 
include, but are not limited to DNA (Southern) hybridization to detect the targeted 

30 allele, polymerase chain reaction (PCR), polyacrylamide gel electrophoresis (PAGE) 
and Western blots to detect DNA, RNA and protein. 

The following examples are provided to illustrate the present invention 
without, however, limiting the same hereto. 
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EXAMPLE 1 
Characterization of DNA Molecules Encoding NHL 
M68/DcR3 identification - The human osteoprotegerin (OPG) sequence (Acc. 

5 # U94332), which is a member of the TNFR-related family, was used to searched 
Genbank using the programs TBLASTN and TFASTX3 to identify novel gene family 
members. Two EST sequences (GenBank Acc. # A Al 55701 and AA025672) were 
identified that showed sequence similarities to the cysteine repeats of the OPG 
sequence. These EST sequences were then used to identify additional EST sequences, 

10 which formed a single EST cluster (GenBank Acc. #s aa577603, aa603704, aa613366, 
aal58406, w67560, aa325843, aal55646, aa025673, aa514270, m91489). Two clones 
were further characterized, which were derived from colon tumor and germ cell tumor 
libraries (Research Genetics, Inc). DNA sequence analysis revealed two alternatively 
spliced forms of the 5'-end UTR of M68/DcR3. The M68/DcR3 open reading frame 

15 was confirmed by sequence analysis of clones obtained by PCR cloning from a normal 
human cDNA library (Clontech). 

M68/DcR3 BAC identification and sequencing - To further delineate the gene 
structure of M68/DcR3, genomic DNA was obtained using a human "Down to the 
Well" ™ genomic bacterial artificial chromosome (BAC) library (Genome Systems, 

20 Inc.) according to the manufacturer's protocol. Two sets of PCR primers, C68.36F: 
S'-CACAGGTTCAGCATGTTTGTGCGTC-S' (SEQ ID NO:4) and C68.275R: 
5 , -CACAGTCCCTGCTGGCCTCTGTCTA-3 , (SEQ ID NO:5), and E68.715F: 
5'-CAGGACATCTCCATCAAGAGGCTGC-3' (SEQ ID NO:6) and E68.972R: 
5 * - A AT A AG AGGGGGCC AGG ATC AGTGC-3' (SEQ ID NO:7), were used to carry 

25 out PCR reactions to identify positive wells that contained the full-length M68/DcR3 
gene. The PCR conditions used were 94°C for 9min, 35 cycles of (94°C, 30 sec., 68°C 
3 min.) followed by 72°C for 10 min. Two positive BAC clones were identified and 
characterized by restriction digestion and BAC-end sequence analyses, of which 
hbml68 was selected for shotgun sequencing. 

30 A shot-gun library for BAC hbml68 was constructed using a conventional 

strategy. Briefly, two 150-ml bacterial cultures were combined and purified using a 
modified protocol of the plasmid-Maxi kit (QIAGEN) followed by CsCl gradient 
purification. After butanol extraction and isopropanol precipitation, BAC DNA was 
nebulizied at 10 psi for 60 seconds to generate randomly sheared fragments. 
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Following ethanol precipitation, the fragments were end-repaired using T4 polymerase 
(Promega) and BstXI adaptors (Invitrogen) were Iigated overnight. Removal of 
excess, unligated adaptors and size selection was performed using a cDNA sizing 
column (Life Technologies, Inc.) to generate genomic fragments in the size range of 

5 1500 to 3000 bp. Adaptor Iigated fragments were cloned into a modified pBlueScript 
SK + vector (Stratagene) and transformed in XL2-Blue ultracompentent cells 
(Stratagene). Approximately 1000 clones were isolated, plasmids were purified using 
the Turbo miniprep kits (QIAGEN), and both plasmid ends were sequenced with the 
BigDye terminator kits (Perkin-Elmer). Sequence data were assembled using 

10 Phred/Phrap/Consed where single-stranded and gap regions were closed using a 
directed sequencing strategy. 

NHL identification and sequencing - The genomic clone for the NHL gene 
was obtained and sequenced. The transcript was identified through exon prediction 
using GRAIL2 and sequence alignment to a contiguous 4.5 kilobase region of 

15 chromosome 4 (88% sequence identity). The complete exon structure of NHL was 
subsequently confirmed by RT-PCR analysis. The exon structure was confirmed by 
RT-PCR using polyA RNA from a human colorectal adenocarcinoma cell line, 
SW480 (Clontech). Primers were designed based on the genomic sequence that were 
predicted to be exons. RT-PCR reaction were carried out with SW480 polyA RNA 

20 using standard conditions with TaqGoId Enzyme at 94°C for 12min, 35 cycles of 
(94°C, 30 sec, 60C, 30 sec, and 68°C 2-6 min.) followed by 68°C for 7 min. Most 
sequence confrimation was accomplished by RT-PCR, although first junction between 
exon 1 and 2 was confirmed by 5'RACE and junctions between exon 26-29 were by 
RCCA. The primers used were as follows: 



Junction of Exons 


Confirmed bv Primers 


H01/H02 


hdkw (5'RACE) 


H02/H03 


hdiy,hdiz 


H03-H09 


hdid,hdie,hdja,hdjb 


H09-H13 


hdja,hdie 


H13-H18 


hdje,hdjf 


H18-H23 


hdjg.hdjh 


H23-H26 


hdji.hdjj 


H26-H29 


hdkv,r543(RCCA) 


H29-H31 


hdij,hdmu,hdnd,hdne 
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H31/H32 hdij.hdmu 

H32/H34 hdip,hdil,hdmv,hdik,hdli 

H34/H35 hdng,hdnh 

5 HDID - 5 -GTG A ATGGC ATCCTG G AG AG-3 ' (SEQ ID N0:8); 

HDE - 5 -GTCTCC AGGC AGCTC AAC AG-3 ' (SEQ ID N0:9); 

HDD - 5 -ACCCTGTCCCTCCTGTCTGA-3' (SEQ ID NO: 10); 

HDIY - 5 -AGACCCTAAGATGTTCGGAG-3' (SEQ ID NO: 1 1); 

HDE - 5 -G ATG ACCTGTGTG AGTTGCG-3 ' (SEQ ID NO: 12); 
10 HDJA - 5-CGCAACTCACACAGGTCATC-3'(SEQ ID NO:13); 

HDJB - 5-GGAGTCAGGTCAAAGGATGC-3' (SEQ ID NO: 14); 

HDJC - 5-GCATCCTTTGACCTGACTCC-3'(SEQ ID NO: 15); 

HDJD - 5-GGTCTGAAACGTGATCTGGG-3' (SEQ ID NO: 16); 

HDJE - 5'-CCCAGATCACGTTTCAGACC-3' (SEQ ID NO: 17); 
15 HDJF - 5 -CGATG ATGTGTGGGTTCTCC-3 ' (SEQ ID NO:18); 

HDJG - 5 -GGAGAACCCACACATCATCG-3' (SEQ ID NO: 19); 

HDJH - 5-CGTGTCTGAGAAGTCCAGCC-3' (SEQ ID NO:20); 

HDJI - 5 -GGCTGGACTTCTCAGACACG-3' (SEQ ID NO:21); 

HDJJ - 5 -ACAGCATCTTCTCCACGCAC-3'(SEQ ID NO:22); 
20 HFMU - 5 '- AGTCCTCTGGCTTTGC AGTG-3 ' (SEQ ID NO:23); 

HDKV - 5 -TGTGCGTGGAGAAGATGCTG-3' (SEQ ID NO:24); 

HDKW - 5 -GGCTGG AAAGGG AAGTCTAC-3 ' (SEQ ID NO:25); 

HDND - 5-TGGTTCAGGTGCTCTTGGGG-3' (SEQ ID NO:26); 

HDNE - 5 -CGTGAAGCAGGAGTTGAGCC-3' (SEQ ID NO:27); 
25 HDIK - 5-ATCTTGCTCTGGGTCTTCCC-3' (SEQ ID NO:28); 

HDIL - 5 -CACTGCAAAGCCAGAGGACT-3' (SEQ ID NO:29); 

HDIP - 5 -ATAAGCAAGACGACGACCTC-3' (SEQ ID NO:30); 

HDU - 5-CTATTCTGTTGGGTGGGTTC-3' (SEQ ID NO:31); 

HDMV - S'-CGTGCCTCCTGTGCTTACCC-S' (SEQ ID NO:32); 
30 HDNG - 5 -CAG ACCCC A AGGTAGCTC AG-3 ' (SEQ ID NO:33); 

HDNH - 5 -GGAAGACCC AGAGC AAGATC-3' (SEQ ID NO:34). 
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Amplified product were subject to direct sequencing after purification from an agarose 
gel or cloned into a TOPO PCR cloning vector (Invitrogen) for sequencing. 
Multiple sequence alignment of NHL to known helicases showed that NHL contains 

5 all the seven critical helicase domains. BLAST analysis of the predicted 1,219 amino 
acid sequence (see Figure 2, SEQ ID NO:2) reveal an approximately 26% sequence 
identity and 48% sequence similarity to the RAD3/ERCC2 gene family of DNA 
helicases (see Figure 3). Review of this sequence data shows that two partial human 
cDNA clones (Acc. No. al080127 and ab029011) are deposited. No. al080127 covers 

10 exon 25-35 while ab02901 1 covers exons 9-35. Ab02901 1 starts at amino acid 240 
of the full length human NHL protein disclosed herein, but also differs at exon 35 and 
appears to be a fusion transcript with M68. This cDNA was isolated from brain 
tissue, which has been known to express rare transcripts. 

15 EXAMPLE 2 

Northern Analysis of human NHL Expression 
Messenger RNA (mRNA) obtained from human brain, heart, skeletal muscle, 
colon, thymus, spleen, kidney, liver, small intestine, placenta, lung, and peripheral 
blood leukocytes. Two fig of polyA + RNA were run on each lane a denaturing 
20 formaldehyde 1% agarose gel, and transferred to a charged-modified nylon membrane. 
The probe was made using a 733 bp fragment derived from 1 174-1907 nt of the NHL 

32 

cDNA. This fragment was labeled via the P dCTP random priming method 
(Ambion). Hybridization was carried in ExpressHyb (Clontech) according to the 
manufacturer's protocol except for the final wash, which was at 55°C. Membranes 
25 were exposed to X-ray film with intensifying screen at -80°C overnight. The Northern 
data is presented in Figure 4. Note hybridization of the NHL probe to an 
approximately 4.4 kb transcript. The 7.5 kb transcript may suggest an alternative 
splicing of the NHL RNA. 
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EXAMPLE 3 
Chromosomal localization 
To map the position of M68/NHL in the human genome, primers C68.36F and 

5 C68.275R, were used to carry out PCR reactions to 93 clones of the MIT GeneBridge 
4 panel (Research Genetics) and results were submitted to MIT for analysis. 
M68/DcR3 was mapped to the extreme telomere of chromosome 20, at 20ql3.3, 28cR 
from D20S173 with a lod score of 13. An analogous procedure was also carried out 
with the 83 clones of the Stanford G3 radiation hybrid panel, with PCR results 

10 submitted to the Stanford Genome Center for analysis. Analysis using another pair of 
PCR primers specific to NHL yielded the same result. For fluorescence in situ (FISH) 
analysis, the normal human male fibroblast cell line, L136 (Coriell Cell Repository, 
Camden, NJ) was arrested in mitosis with colcemid (10 /ig/ml). A human 
chromosome 20 a-satellite probe (Vysis, Downers Grove, IL) was directly labeled 

15 with Spectrum Orange dUTP and was used to identify chromosome 20. The M68 
BAC clone was directly labeled with SpectrumGreen dUTP by nick translation 
(Vysis). Slides were counterstained with DAPI stain and viewed under an Olympus 
microscope with narrow blue and DAPI/TRITC filters. Fifty metaphase cells were 
scored to verify that the M68 probe was located on the same chromosome as the 

20 Human Chromosome 20 probe. Radiation hybrid chromosomal mapping reconfirms 
that it is linked to M68 locus,at 20ql3.3. 
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WHAT IS CLAIMED IS: 

1. A purified DNA molecule encoding a mammalian NHL protein. 



2. A purified DNA molecule of claim 1 encoding a human NHL protein 
5 which comprises the amino acid sequence 





MPKIVLNGVT 


VDFPFQPYKC 


QQEYMTKVLE CLQQKVNGIL 


ESPTGTGKTL 


CLLCTTLAWR 




EHLRDGISAR 


KIAERAQGEL 


FPDRALSSWG 


NAAAAAGDPI 


ACYTDIPKII 


YASRTHSQLT 




QVINELRNTS 


YRPKVCVLGS 


REQLCIHPEV KKQESNHLQI 


HLCRKKVASR 


SCHFYNNVEE 




KSLEQELASP 


ILDIEDLVKS 


GSKHRVCPYY 


LSRNLKQQAD 


IIFMPYNYLL 


DAKSRRAHNI 


10 


DLKGTWIFD 


EAHNVEKMCE 


ESASFDLTPH 


DLASGLDVID 


QVLEEQTKAA 


QQGEPHPEFS 




ADSPSPGLNM 


ELEDIAKLKM 


ILLRLEGAID 


AVELPGDDSG 


VTKPGSYIFE 


LFAEAQITFQ 




TKGCILDSLD 


QIIQHLAGRA 


GVFTNTAGLQ 


KLADIIQIVF 


SVDPSEGSPG 


SPAGLGALQS 




YKVHIHPDAG 


HRRTAQRSDA 


WSTTAARKRG 


KVLSYWCFSP 


GHSMHELVRQ 


GVRSLILTSG 




TLAPVSSFAL 


EMQIPFPVCL 


ENPHIIDKHQ 


IWVGWPRGP 


DGAQLSSAFD 


RRFSEECLSS 


15 


LGKALGNIAR 


WPYGLLIFF 


PSYPVMEKSL 


EFWRARDLAR 


KMEALKPLFV 


EPRSKGSFSE 




TISAYYARVA 


APGSTGATFL 


AVCRGKASEG 


LDFSDTNGRG 


VIVTGLPYPP 


RMDPRWLKM 




QFLDEMKGQG 


GAGGQFLSGQ 


EWYRQQASRA VNQAIGRVIR 


HRQDYGAVFL 


CDHRFAFADA 




RAQLPSWVRP 


HVRVYDNFGH 


VIRDVAQFFR VAERTMPAPA 


PRATAPSVRG 


EDAVSEAKSP 




GPFFSTRKAK 


SLDLHVPSLK 


QRSSGSPAAG 


DPESSLCVEY 


EQEPVPARQR 


PRGLLAALEH 


20 


SEQRAGSPGE 


EQAHSCSTLS 


LLSEKRPAEE 


PRGGRKKIRL 


VSHPEEPVAG 


AQTDRAKLFM 




VAVKQELSQA 


NFATFTQALQ 


DYKGSDDFAA 


LAACLGPLFA 


EDPKKHNLLQ 


GFYQFVRPHH 




KQQFEEVCIQ 


LTGRGCGYRP 


EHSIPRRQRA 


QPVLDPTGRT 


APDPKLTVST 


AAAQQLDPQE 




HLNQGRPHLS 


PRPPPTGDPG 


SQPQWGSGVP 


RAGKQGQHAV 


SAYLADARRA 


LGSAGCSQLL 




AALTAYKQDD 


DLDKVLAVLA 


ALTTAKPEDF 


PLLHRFSMFV 


RPHHKQRFSQ 


TCTDLTGRPY 


25 


PGMEPPGPQE 


ERLAVPPVLT 


HRAPQPGPSR 


SEKTGKTQSK 


ISSFLRQRPA 


GTVGAGGEDA 




GPSQSSGPPH 


GPAASEWGL* 


(SEQ ID NO: 


:2) - 







3. An expression vector for expressing a NHL protein in a recombinant 
host cell wherein said expression vector comprises a DNA molecule of claim 2. 

30 

4. A host cell which expresses a recombinant NHL protein wherein said 
host cell contains the expression vector of claim 3. 
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5. A process for expressing a NHL protein in a recombinant host cell, 
comprising: 

(a) transfecting the expression vector of claim 3 into a suitable host cell; and, 
5 (b) culturing the host cells of step (a) under conditions which allow 

expression of said NHL protein from said expression vector. 

6. A purified DNA molecule encoding a human NHL protein which 
consists of the amino acid sequence 

10 MPKIVLNGVT VDFPFQPYKC QQEYMTKVLE CLQQKVNGIL 

EHLRDGISAR KIAERAQGEL FPDRALSSWG NAAAAAGDPI 

QVINELRNTS YRPKVCVLGS REQLCIHPEV KKQESNHLQI 

KSLEQELASP ILDIEDLVKS GSKHRVCPYY LSRNLKQQAD 

DLKGTWIFD EAHNVEKMCE ESASFDLTPH DLASGLDVID 
15 ADSPSPGLNM ELEDIAKLKM ILLRLEGAID AVELPGDDSG 

TKGCILDSLD QIIQHLAGRA GVFTNTAGLQ KLADIIQIVF 

YKVHIHPDAG HRRTAQRSDA WSTTAARKRG KVLSYWCFSP 

TLAPVSSFAL EMQIPFPVCL ENPHIIDKHQ IWVGWPRGP 

LGKALGNIAR WPYGLLIFF PSYPVMEKSL EFWRARDLAR 
20 TISAYYARVA APGSTGATFL AVCRGKASEG LDFSDTNGRG 

QFLDEMKGQG GAGGQFLSGQ EWYRQQASRA VNQAIGRVIR 

RAQLPSWVRP HVRVYDNFGH VIRDVAQFFR VAERTMPAPA 

GPFFSTRKAK SLDLHVPSLK QRSSGSPAAG DPESSLCVEY 

SEQRAGSPGE EQAHSCSTLS LLSEKRPAEE PRGGRKKIRL 
25 VAVKQELSQA NFATFTQALQ DYKGSDDFAA LAACLGPLFA 

KQQFEEVCIQ LTGRGCGYRP EHSIPRRQRA QPVLDPTGRT 

HLNQGRPHLS PRPPPTGDPG SQPQWGSGVP RAGKQGQHAV 

AALTAYKQDD DLDKVLAVLA ALTTAKPEDF PLLHRFSMFV 

PGMEPPGPQE ERLAVPPVLT HRAPQPGPSR SEKTGKTQSK 
30 GPSQSSGPPH GPAASEWGL* (SEQ ID NO: 2). 

7. An expression vector for expressing a NHL protein in a recombinant 
host cell wherein said expression vector comprises a DNA molecule of claim 6. 
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ESPTGTGKTL 
ACYTDIPKII 
HLCRKKVASR 
IIFMPYNYLL 
QVLEEQTKAA 
VTKPGSYIFE 
SVDPSEGSPG 
GHSMHELVRQ 
DGAQLSSAFD 
KMEALKPLFV 
VIVTGLPYPP 
HRQDYGAVFL 
PRATAPSVRG 
EQEPVPARQR 
VSHPEEPVAG 
EDPKKHNLLQ 
APDPKLTVST 
SAYLADARRA 
RPHHKQRFSQ 
ISSFLRQRPA 



CLLCTTLAWR 
YASRTHSQLT 
SCHFYNNVEE 
DAKSRRAHNI 
QQGEPHPEFS 
LFAEAQITFQ 
SPAGLGALQS 
GVRSLILTSG 
RRFSEECLSS 
EPRSKGSFSE 
RMDPRWLKM 
CDHRFAFADA 
EDAVSEAKSP 
PRGLLAALEH 
AQTDRAKLFM 
GFYQFVRPHH 
AAAQQLDPQE 
LGSAGCSQLL 
TCTDLTGRPY 
GTVGAGGEDA 
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8. A host cell which expresses a recombinant NHL protein wherein said 
host cell contains the expression vector of claim 7. 

9. A process for expressing a NHL protein in a recombinant host cell, 
5 comprising: 

(a) transfecting the expression vector of claim 7 into a suitable host cell; and, 

(b) culturing the host cells of step (a) under conditions which allow 
expression of said NHL protein from said expression vector. 

10 10. A purified DNA molecule which comprises the nucleotide sequence as 

set forth in SEQIDNO:L 

11. An expression vector for expressing a NHL protein in a recombinant 
host cell wherein said expression vector comprises a DNA molecule of claim 10. 

15 

12. A host cell which expresses a recombinant NHL protein wherein said 
host cell contains the expression vector of claim 11. 

13. A purified DNA molecule which consists of the nucleotide sequence as 
20 set forth in SEQ ID NO: 1 . 

14. An expression vector for expressing a NHL protein in a recombinant 
host cell wherein said expression vector comprises a DNA molecule of claim 13. 

25 15. A host cell which expresses a recombinant NHL protein wherein said 

host cell contains the expression vector of claim 14. 

16. A purified DNA molecule of claim 13 which consists of the nucleotide 
sequence from about nucleotide 828 to about nucleotide 4587, as set forth in SEQ ID 

30 NO:l. 

17. An expression vector for expressing a NHL protein in a recombinant 
host cell wherein said expression vector comprises a DNA molecule of claim 16. 
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18. A host cell which expresses a recombinant NHL protein wherein said 
host cell contains the expression vector of claim 17. 

19. A substantially purified NHL protein which comprises the amino acid 
5 sequence as set forth in SEQ ID NO: 2. 



20. A substantially purified NHL protein which consists of the amino acid 
sequence as set forth in SEQ ID NO:2. 



10 21. A substantially purified NHL protein which comprises the amino acid 

sequence as set forth in SEQ ID NO:2, wherein said protein is a product of a DNA 
expression vector comprising SEQ ID NO:l and contained within a recombinant host 
cell. 



15 22. A method of identifying modulators of NHL activity, comprising: 

(a) combining a test compound with a NHL protein, wherein NHL comprises 
the amino acid sequence as set forth in SEQ ID NO:2; and, 

(b) measuring the effect of the test compound on the NHL protein. 

20 23. An isolated DNA molecule which comprises the nucleotide sequence 

as set forth in SEQ ID NO:3. 

24. An isolated DNA molecule of claim 20 which comprises from about 
nucleotide 47000 to about nucleotide 85500 of SEQ ID NO:3. 

25 

25. An isolated DNA molecule of claim 23 which comprises from about 
nucleotide 47095 to about nucleotide 85316 of SEQ ID NO:3. 



26. A substantially purified NHL protein of claim 21 wherein said protein 
30 is a product of a DNA expression vector comprising from about nucleotide 828 to 
nucleotide 4587, as set forth in SEQ ID NO:l, and contained within a recombinant 
host cell. 
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AGTCAGCCCT GCTGCCAGCC AGT6CC6GGT GCTGGGGACT CAGGGAGGCC CGCCGGGACC ACTGCGGGAC 
AGTGAGCCGA GCAGAAGCTG GAACGCAGGA GAGGAAGGAG AGGGGGCGGT CAGGGCTCTC AGGAGCCGGG 
?Stgggcaa GGCGCAGCCG TTTTCAAATT TTCAGGAAAG cggtcggctc acactcgagc agtamaaga 
TGCCTCTGGG GAGGAGGCCC GTGCAGCTCT CCGGGCAATG GTGGTGGCTC GGCCTAGAGA GaGGTAGTG 

gaacgcagac cctggtgggg gaatgacatc aagggaggag acgggcggga ccccagattt ctgcctgtgg 

GCGATGGAAG TGAGGTTCAC TGGCCAGCGG AGCCGGACAC AGAACGCGCA AAACGCCGTG TAGGOCTGGA 
GGAGCCGAAG AGCAGGCGGA CCCCCTCCGC GGGGGAACAG THCCGCCGG GAGCACAAAG CMCGGACCG 
GAAGTGGGGG GCGGAAGTGC AGTGGGCTCA GCGCCGACTG CGCGCCTCTG CCCGCGAAAA CTCTGAGCTG 
GCTGACAGCT GGGGACGGGT GGCGGCCCTC GACTGGAGTC GGTTGAGTTC CTGAGGGACC CCGGTTCTGG 
AAGGTTCGCC GCGGAGACAA GTGAGCAGTC TGTGCCATAG GGATTCTCGA AGAGAACAGC G^GTCCC 
ATTCTArATP rrfGCATCGC TTACCAGGAG TGCCCGAGAC CCTAAGATGT TCGGAGTGGT TTTTTCGCAC 

i2SSJ5 Sctgcccc tcagccacgc tctgtgccct tctgagaaca ggctgataig CCCAAGATAG 
S %S SctSot Sccta caaatgccaa caggagtaca tgaccaaggt 

S CTGCAGCAGA AGGTGAATGG CATCCTGGAG AGCCCTACGG GTACAGGGAA GACGCTGTGC 

ctgctgtgca ccacgctggc ctggcgagaa cacctccgag acggcatctc tgcccgcaag attgccgaga 

GGGCGCAAGG AGAGCTTTTC CCGGATCGGG CCTTGTCATC CTGGGGCAAC GCTGCTGCTG CTGCTGGAGA 
CCCCATAGCT T6CTACACGG ACATCCCAAA GATTATTTAC 6CCTCCAGGA CCCACTCGCA ACTCACACAG 
GTCATCAACG AGCTTCGGAA CACCTCCTAC CGGCCTAAGG TGTGTGTGCT GGGCTCCCGG ^CAGCTGT 
GCATCCATCC TGAGGTGAAG AAACAAGAGA GTAACCATCT ACAGATCCAC TTGTGCCGTA AGAAGGTGGC 

SgtStcc TGTCATTTCT acaacaacgt AGAAGAAAAA agcctggagc aggagctggc cagccccatc 

CTGGACATTG AGGACTTGGT CAAGAGCGGA AGCAAGCACA GGGTGTGCCC TTACTACCTG TCCCGGAACC 

Sagca Sgacatc atattcatgc cgtacaatta cttgttggat GCCMGAGCC gcagagcaca 

CAACATTGAC CTGAAGGGGA CAGTCGTGAT CTTTGACGAA GCTCACAACG TGGAGAAGAT GTGTGAAGAA 
TCGGCATCCT TTGACCTGAC TCCCCATGAC CTGGCTTCAG GACTGGACGT CATAGACCAG GTGCTGGAGG 
AGCAGACCAA GGCAGCGCAG CAGGGTGAGC CCCACCCGGA GTTCAGCGCG GACTCCCCCA GCCCAGGGCT 
GAACATGGAG CTGGAAGACA TTGCAAAGCT GAAGATGATC CTGCTGCGCC TGGAGGGGGC CATCGATGCT 
GTTGAGCTGC CTGGAGACGA CAGCGGTGTC ACCAAGCCAG GGAGCTACAT CTTTGAGCTG TTTGCTGAAG 
CCCAGATCAC GTTTCAGACC AAGGGCTGCA TCCTGGACTC GCTGGACCAG ATCATCCAGC ACCTGGCAGG 
ACGTGCTGGA GTGTTCACCA ACACGGCCGG ACTGCAGAAG CTGGCGGACA TTATCCAGAT TGTGTTCAGT 
GTGGACCCCT CCGAGGGCAG CCCTGGTTCC CCAGCAGGGC TGGGGGCCTT ACAGTCCTAT AAGGTGCACA 
TCCATCCTGA TGCTGGTCAC CGGAGGACGG CTCAGCGGTC TGATGCCTGG AGCACCACTG CAGCCAGAM 
GCGAGGGAAG GTGCTGAGCT ACTGGTGCTT CAGTCCCGGC CACAGCATGC ACGAGCTGGT CCGCCAGGGC 
GTCCGCTCCC TCATCCTTAC CAGCGGCACG CTGGCCCCGG TGTCCTCCTT TGCTCTGGAG ATGCAGATCC 
CTTTCCCAGT CTGCCTGGAG AACCCACACA TCATCGACAA GCACCAGATC TGGGTGGGGG TCGTCCCCAG 
AGGCCCCGAT GGAGCCCAGT TGAGCTCCGC GTTTGACAGA CGGTTTTCCG AGGAGTGCTT ATCCTCCCTG 
GGGAAGGCTC TGGGCAACAT CGCCCGCGTG GTGCCCTATG GGCTCCTGAT CTTCHCCCT TCCTATCCTG 
TCATGGAGAA GAGCCTGGAG TTCTGGCGGG CCCGCGACTT GGCCAGGAAG ATGGAGGCGC TGAAGCCGCT 
GTTTGTGGAG CCCAGGAGCA AAGGCAGCTT CTCCGAGACC ATCAGTGCTT ACTATGCAAG GGTTGCCGCC 
CCTGGGTCCA CCGGCGCCAC CTTCCTGGCG GTCTGCCGGG GCAAGGCCAG CGAGGGGCTG GACTTCTCAG 
ACACGAATGG CCGTGGTGTG ATTGTCACGG GCCTCCCGTA CCCCCCACGC ATGGACCCCC GGGTTGTCCT 
CAAGATGCAG TTCCTGGATG AGATGAAGGG CCAGGGTGGG GCTGGGGGCC AGTTCCTCTC TGGGCAGGAG 
TGGTACCGGC AGCAGGCGTC CAGGGCTGTG AACCAGGCCA TCGGGCGAGT GATCCGGCAC CGCCAGGACT 
ACGGAGCTGT CTTCCTCTGT GACCACAGGT TCGCCTTTGC CGACGCAAGA GCCCAACTGC CCTCCTGGGT 
GCGTCCCCAC GTCAGGGTGT ATGACAACTT TGGCCATGTC ATCCGAGACG TGGCCCAGTT CTTCCGTGTT 
GCCGAGCGAA CTATGCCAGC GCCGGCCCCC CGGGCTACAG CACCCAGTGT GCGTGGAGAA GATGCTGTCA 
GCGAGGCCAA GTCGCCTGGC CCCnCHCT CCACCAGGAA AGCTAAGAGT CTGGACCTGC ATGTCCCCAG 
CCTGAAGCAG AGGTCCTCAG GGTCACCAGC TGCCGGGGAC CCCGAGAGTA GCCTGTGTGT GGAGTATGAG 
CAGGAGCCAG HCCTGCCCG GCAGAGGCCC AGGGGGCTGC TGGCCGCCCT GGAGCACAGC GAACAGCGGG 
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CGG6GA6CCC TG6CGAGGAG CAGGCCCACA GCTGCTCCAC CCTGTCCCTC CTGTCTGAGA 
AGAGGCCGGC AGAAGAACCG CGAGGAGGGA GGAAGAAGAT CCGGCTGGTC AGCCACCCGG 
AGGAGCCCGT GGCTGGTGCA CAGACGGACA GGGCCAAGCT CTTCATGGTG GCCGTGAAGC 
AGGAGTTGAG CCAAGCCAAC TTTGCCACCT TCACCCAGGC CCTGCAGGAC TACAAflffiTT 
CCGATGACTT CGCCGCCCTG GCCGCCTGTC TCGGCCCCCT CTTTGCTGAG GACCCCAAGA 
AGCACAACCT GCTCCAAGGC TTCTACCAGT TTGTGCGGCC CCACCATAAG CAGCAGTTTG 
AGGAGGTCTG TATCCAGCTG ACAGGACGAG GCTGT6GCTA TCGGCCTGAG CACAGCATTC 
CCCGAAGGCA GCGGGCACAG CCGGTCCTGG ACCCCACTGG AAGAACGGCG CCGGATCCCA 
AGCTGACCGT GTCCACGGCT GCAGCCCAGC AGCTGGACCC CCAAGAGCAC CTGAACCAGG 
GCAGGCCCCA CCTGTCGCCC AGGCCACCCC CAACAGGAGA CCCTBQCJGC CAJCCACACT 
GGGGGTCTGG AGTGCCCAGA GCAGGGAAGC AGGGCCAGCA CGCCGTGAGC GCCTACCTGG 
CTGATGCCCG CAGGGCCCTG GGGTCCGCGG GCTGTAGCCA ACTCTTGGCA GCGCTGACAG 
CCTATAAGCA AGACGACGAC CTCGACAAGG TGCTGGCTGT GTTGGCCGCC CTGACCACTG 
CAAAGCCAGA GGACTTCCCC CTGCTGCACA GGTTCAGCAT GTTTGTGCGT CCACACCACA 
AGCAGCGCTT CTCACAGACG TGCACAGACC TGACCGGCCG GCCCTACCCG GGCATGGAGC 
CACCGGGACC CCAGGAGGAG AGGCTTGCCG TGCCTCCTGT GCTTACCCAC AGGGCTCCCC 
AACCAGGCCC CTCACGGTCC GAGAAGACCG GGAAGACCCA GAGCAAGATC TCGTCCTTCC 
TTAGACAGAG GCCAGCAGGG ACTGTGGGGG CGGGCGGTGA GGATGCAGGT CCCAGCCAGT 
CCTCAGGACC TCCCCACGGG CCTGCAGCAT CTGAGTGGGG CCTCJAGGAT GTGCCCAGCC 
TGCCACACCG CCTCCAGGAA GCAGAGCGTC ATGCAGGTCT TCTGGCCAGA GCCCCAGTGA 
GTGCCCACGG AGGCCCCCAG CACACCCAAC GTGGCTTGAT CACCTGCCTG TCCAGCTCTG 
GTGGGCCAAG AACCCACCCA ACAGAATAGG CCAGCCCATG CCAGCCGGCT TGGCCCGCTG 
CAGGCCTCAG GCAGGCGGGG CCCATGGTTG GTCCCTGCGG TGGGACCGGA TCTGGGCCTG 
CCTCTGAGAA GCCCTGAGCT ACCTTGGGGT CTGGGGTGGG TTTCTGGGAA AGTGCTTCCC 
CAGAACTTCC CTGGCTCCTG GCCTGTGAGT GGTGCCACAG GGGCACCCCA GCTGAGCCCC 
TCACCGGGAA GGAGGAGACC CCCGTGGGCA CGTGTCCACT TTTAATCAGG GGACAGGGCT 
CTCTAATAAA GCTGCTGGCA GTGCCC (SEQ ID N0:1). 
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MPKIVLNGVT VDFPFQPYKC QQEYWTKVLE CLQQKVNGIL ESPTGTGKTL CLLCTTLAWR 
EHLRDGISAR KIAERAQGEL FPDRALSSWG NAAAAAGDPI ACYTDIPKII YASRTHSQLT 
QVINELRNTS YRPKVCVLGS REQLCIHPEV KKQESNHLQI HLCRKKVASR SCHFYNNVEE 
KSLEQELASP ILDIEDLVKS GSKHRVCPYY LSRNLKQQAD IIFMPYNYLL DAKSRRAHNI 
DLKGTVVIFD EAHNVEKMCE ESASFDLTPH DLASGLDVID QVLEEQTKAA QQGEPHPEFS 
ADSPSPGLNM ELEDIAKLKM ILLRLEGAID AVELPGDDSG VTKPGSYIFE LFAEAQITFQ 
TKGCILDSLD QIIQHLAGRA GVFTNTAGLQ KLADIIQIVF SVDPSEGSPG SPAGLGALQS 
YKVHIHPDAG HRRTAQRSDA WSTTAARKRG KVLSYWCFSP GHSMHELVRQ GVRSLILTSG 
TLAPVSSFAL EMQIPFPVCL ENPHIIDKHQ IWVGVVPRGP DGAQLSSAFD RRFSEECLS5 
LGKALGNIAR VVPYGLLIFF PSYPVMEKSL EFWRARDLAR KMEALKPLFV EPRSKGSFSE 
TISAYYARVA APGSTGATFL AVCRGKASEG LDFSDTNGRG VIVTGLPYPP RMDPRVVLKM 
QFLDEMKGQG GAGGQFLSGQ EWYRQQASRA VNQAIGRVIR HRQDYGAVFL CDHRFAFADA 
RAQLPSWVRP HVRVYDNFGH VIRDVAQFFR VAERTMPAPA PRATAPSVRG EDAVSEAKSP 
GPFFSTRKAK SLDLHVPSLK QRSSGSPAAG DPESSLCVEY EQEPVPARQR PRGLLAALEH 
SEQRAGSPGE EQAHSCSTLS LLSEKRPAEE PRGGRKKIRL VSHPEEPVAG AQTDRAKLFM 
VAVKQELSQA NFATFTQALQ DYKGSDDFAA LAACLGPLFA EDPKKHNLLQ GFYQFVRPHH 
KQQFEEVCIQ LTGRGCGYRP EHSIPRRQRA QPVLDPTGRT APDPKLTVST AAAQQLDPQE 
HLNQGRPHLS PRPPPTGDPG SQPQWGSGVP RAGKQGQHAV SAYLADARRA LGSAGCSQLL 
AALTAYKQDD DLDKVLAVLA ALTTAKPEDF PLLHRFSMFV RPHHKQRFSQ TCTDLTGRPY 
PGMEPPGPQE ERLAVPPVLT HRAPQPGPSR SEKTGKTQSK ISSFLRQRPA GTVGAGGEDA 
GPSQSSGPPH GPAASEWGL* (SEQ ID NO: 2). 

FIG. 2 



SUBSTITUTE SHEET (RULE 26) 



WO 01/42434 



4/12 



PCT/US00/33065 



>- 



CO 



>- >- 



o o o o o 

. — I — > — I — I CO 
Iz Z Z Z ll 



CO CO CO < o 



CO CO CO CO CJ 

> :> ~ > zn 




^1 



Q Q Q Q CJ 
CO I — CO I — o 

^ I o: u 

V V _l 



X 

o 

CO 

o 



m 

UJ 
Q 




CO CO (0 < — < 

12. ■ ' 




O 
UJ 
I 

o 




Ld UJ Ld Ld O 
d d CL H ^ 




>- _ J — I >- Q 
I Q_ G II— 



_J -J J J > 
LU|Q Q O Z 1 



_>->->- Z > 
hi 1 1 i . i 



m 

QlQ O 
Q) < < 

a: or QT 



O 
CL 

O 

°i 

Q_ 
X 



CO 



_ ^ >- 

Q O < 

uoo Cl 
^ co ^: q 

Ld O 



C£ OH Ctl >- 
>- >- 
O Q 



< 
< 



or UJ ^ 

<c — I — i 



^1 





> 




o ^ ^ ^ cc 

LjJ Ld Q L±J Q 



Ld Ld Ld Ul 
Q_ CO CO CL 
> ^ Z§ > 



UJ 

■ . . . ca 

cr cr cr cr o 

CO CO CO co < 

uuuuo: 

>->->->- Ld 
>■— ' » — ■ >— <£ 

71 




(/) I I X 

Ld Ld UJ CO 




Q 
CL 
D 
O 

ID i_ 

Q iO ^- O 
CLQ Q I _J 

<D < < CL X 




o o 

uj a: uj < — * <i. 

o: ^ k k o i>o 

z < > > -J 

CO O O CO Ld Cl> 





1— 


\— 


h- 


col 


_l 




_i 


_J 


o 


Ld 


a: 


CO 


CO 


_J 


(Y 


a: 


cc 


X 





< Ld < O 

O O O O 

> > > > 

> > > O 
^ h- z ^ 

o o o o 

Q ^ ~~ 









Ld UJ UJ 












CO CO Gl 


1— 




l*> >• > 


> 




or: uj co 


Ld 


AO 


CL CL Q. 


CL 


O X X 


X 


1— 


I ■ — ' kJ\ix\i i • k-X«i 










Cl 












O 




lO 






Q rO <- 


°, 




Q. O Q 




_J 


<D < < 


CL 


X 


cc en en 


X 


Z 



SUBSTITUTE SHEET (RULE 26) 



WO 01/42434 



~ > > > 5 

CO UJ LjJ Q C£ 

< < < < a: 
► ——■—■—«(/) 

£ g GT< 




(/)(/)(/) Ql 
>->->- 
~ ~ > 



Z Z Z Z Q 



< o u < < 

u I Ll I O 

Z (/) Q i O 

i q: tr >- 




uj u o: co 
>- >- _j co 

LdEglUJ < ^ 




o o b o u. 

Ld Z < CO 
I Q. CO Q- < 

I >- — I Q_ Ld 

> q_ _j 

O U UJ o CO 
Z ' — ' — I X ^£ 
CO Z Q < U 



5/12 



PCT/US00/33065 



en ^1 CH I 

^: o ^ Q Q- 
loo co kjwui 

Q Q Q O 

\£ C£ V£ ^ < 

Q LxJ UJ IO: ^ 
l_dfCOl2 
— ~ > > 
C!f CC ^ h- Ld 

Iv uj O ^ mi 

3? Q UJ C5 -J 
I' — ' — j — 1 — 1 p^l 

v <f co i — O 



UJ Q, 

i<ra=L^ 



o w z > 
o: ^ o Q 

COO J 

- < u . 
Bo: ^ o^fcoi 



CO 
CO 



CO I — Q 
^ Q CO K I 
UJ 0^ Q- 

Q 




z z z z ^ 

O O O . O UJ 




CO_JZ2JO <Lt=J 
ZOO*: I 

co ^ ^ o: o 

0- CO CO < ^ 



CO lUJ UJ UJ Q 



CO CO CO CO 2 



_J _J _l > < 

>- >->->- UJ 
Q U Ld UJ < 
> —■ > JLl 

[^ESu-ESSl 
tr cc Qt q: uj 
rv v v cr: u. 

_J U- U- u. _>i. 



WBSm 



< Ol 



U_ Lu Lu Lu Q_ 

— XX X^ 

[col lu uj LdFE3 

qT< < < > 

^ Dd I — O 

cr q: ct: ct: co 




< < < < Ld 
UJ UJ Ld LxJ > 
OH ^ 0< 
— i -i —i Q L 

Q Q Q Ld < 
Z O Ld O O 

n n_ Q- Q- Ld 

> > > > ^ 

Q_ Q_ CL Q 1 

QI-ZZ-J 
CO 1 1 1 <C <■ '— ■ 

I — u_ u. x ^ 

UJ CL O < — 1 

O UJ o o ^ 

I Ld UJ I — <C 

cc l— z q: uj 
h-Q]< <0 

(/)—.<< UJ 
O O Z CO S 
CO < < < z 
a: co o Ld — i 
< x o ct: o 

I I — I — ' Cl. 

o o o o CO 

Z O 0u Ld Q_ 
> > > > CO 

_J _l _) — I Q 

^ ^ cc* < 
iV- >- >- >- u_i 



Q Q U U U. 

o a v. xi 



GD 



Q 
< 



Q 

CL 

O 

o 



Q-O 
<L> < 
(TKKXZ 



O 
CL 

O 

m l_ 

Q fO O 
CL O O I -J 

a> < < a. x 
or ct o: x z 



O 

CL 
O 

IT) u. 
Q lO «~ Q 
a. O Q I — ' 
«D <C < Q_ X 

cc IX x z 



WO 01/42434 



I 

o 

CO 

b_ 

o 
z 
> 
o 
z 

I— » 

or 



D 



Q O 
LU LU 
> > 
LU -J 
h- CO 

Ld O 



I — 1 — <C 
OH OH OH 
> > 



CO CL 
X CO CL 
b_ CO 
Q O 
< CL 

1 CO 

Q O 
UJ 
CO 
CL 
O 
> 
CO 



Ld 



_1 _ I _J — J > 



CO _l CO 
CO CO 



CO ~ 

oh o 



rr oh on 

COlLU LU Ld 



UUUU 

Lu_ 



^ a: g k o 

I — Q. CL CL O 
C£ b£ ^ ^ < 

nolo: Ng: eg 



CO LUfQlO 



< 
x 



_j ~ ct o 

O Q O < 




< CO CO < o 
J ^ I- Q. Q 
d CL d d 



lil LU LU LU Q 



CO CO <C CJ 

S X X X o 
p J -i 

[t m> > >r R 
^ ^ ^ cr o 



c^ cc c£ c£ i- 
co I — i — ^ 
^ k gTTTI 



O 
CL 
13 
O 

m L. 

Q PO r- (J) 

CL Q Q I _J 
LU < < Q- X 
OH OH OH X Z 



6/12 



PCT/US00/33065 




Q LU LU LU OH 

Q_ CL CL LU 

st: ^: ^ ^ x 
? ~ ~ S 



CO 

-H — — _l X 

CO CO CO CO o 

< < < < CL 



CO CO 




O f ^ CO CD Cg 

z Lur ~" 



I 



LU LU| 



>- >- >- >- O 
i — i — i — i — cl 



l> 


h— 1 




ATL 


_l 

< 


1 — 1 X 

1 — 1 — > 

< < ^ 




Lu 


Li_ >-| 


Ad 


1— 
< 


> < o 






> -1 -Jl 



(/) ^ O I — o 



Q 

CL 

o 

lO i- 

tn *- <j> 
< < 



o 

CL 
LU 

OH OH OH 



CL 



I— CL CL — LU 
QQQQI/) 

' ~ < CE 




Ld Ld Ld. . 



v fv y ^ < 



CO < < < u 
— > > > < 




□_ Cl Cl □_ 
O — i —i O 




CO I — CO 

^ ^ 2 2 x 

H < O H Ql 

_J >- >- L_ Z 

CE CO CO I — LU 

Ld i£ LU < — I 
O O 



l^> > > CL 

Q_ I — l — Cl u_ 



LNFI 


LNF 


Lu Li_ 
O Q 
_l _J 


5 


TKN/fi 


PRMI 


Q_ CL 


LU 
_J 

<L, 


l>- 






L_| 


Q 


o 




SS 


1 _! 1 1 J> 

vTs. v/> Vft> 'vh < 



CO CO CO CO < 




Q 
Q. 

O 

o n o 
Cl Q Q I 
LU < < CL X 
OH OH OH X Z 



o 
to 

o 



SUBSTITUTE SHEET (RULE 26) 



WO 01/42434 



7/12 



PCT/US00/33065 



Q. < Q. O ^ 
QQQQ(/1 
CO Q_ Q_ O oh 




-T _l I — I f*L 

I- V, ^ 0£ < 
Ou 



> > ~ ^ 

7 U LJ 2 Y 



Z Q Q Ll) 

_i -I -I -J < 

I I ■ — ■ < — ■ —II 

O O O O Q 

^ ^ 2 O OH 

Ld I— CO Ld < 

^ o nr >- rv 



j (7) (/) 



ro (/i (/) < lu 



l> > >.^>~ 




>- >- >- >r > 
(/I _l ! — I O Q- 





v v. 




> — >'-' cc 

v. o. 

co co co 00 o 

Ld Ld Ld O 





< < < <L O 




W n Q P O 
i v v v. ^: oil 




o o o o co 

CO Z Z Z O J_ 
Q (/)(/) Ld CL 



Ld LJ LU 
, >> JQ 

_j ^ — > -i < 
>- > >- >= — 1 

Z Z Z Z CO 

o: ce ce co 

< CO CO < Ld 













< 



Z I— 1- ^ < 

O Ld Ld Ld CO 



<f <C < < I- 



Ld Ld Ld Ld CO 
< O X < O 



Ld Ld Ld Ld O 
OH OH OH OH O 

1 — . 1 — 1 1 — 11 — 1 o 
00:000 

>- >- >- L_ ^ 
fV 7 I- Q S 

W 




Ld Ld Ld Ld 



rv eg ^ 

< < < -j 

rv v v V. > 



O 
CL 
Ld 



Q 
CL 

O 



LO 



< < 

OH OH OH 



Q. 
X 



d 
CL 
Ld 



Q 
< 



o 
< 



Q 
cl 

O 

o 
I 

Cl 
X 



on oh oh 

SUBSTITUTE SHEET (RULE 26) 



Q 
Cl 
D 
O 

lO 

Q <- O 
d Q Q I -J 
Ld < < CL X 

o: a o: x z 



WO 01/42434 



PCT/US00/33065 



X Q Q 

y u Q 

— i >- — ' 

co co co 



o co 

LU > 
— I < 
10 Q 
_J UJ 
— i o 



O O O 



O 

UJ 



O 
Q 

Q_ 
Q 



O 
Q 
CO 
< 



O CO 
_) CL 

o < 
lu <C 



.. I — U- <C 

o boo < 
" < < Q- 




co 

CO CO 



< < 

Q Q 

h- I — 

CO CO 



_J > < 
CO o > 



h- i — i — 



CO 



IZI 



X o 

o < 

CL Q 

O CO 



> 

X 

12. 



O < Q 

Ld Q >- 

I — I — > 
J I C£ 



O 
< 



O UJ Q_ 

O OK 




or 



CO O 
CO O iQcpjtSI 



X o 

I — O LU 

Ld S CO 

CO UJ v — ^ 



<: u_ 



o 



o 



CO 



co| 



UJ 



O K) 
Q_ O 
UJ < 



IT) 

Q 
< 



Q 
CL 

O 

o 



[i: a o: x 



Q_ X 



CL O 
UJ < 



Q 
CL 

O 



LO 



< 



CL X 



Ql CC Od X ~Z. 



SUBSTITUTE SHEET (RULE 26) 



WO 01/42434 



PCT/US00/33065 



ct 
o 
h— 

o 
< 

> 

CL 
LU 
LJ 
CL 
X 
CO 

> 

_l 

ct 



o 
>- 

Li_ 

8 



9/12 

o o . 

O < I 

< CO 

< o 

< _j 
\— < 

CO Ct 



< 

> 



X 

it 
it 

CL 
Q 
LU 
< 



it CL 

it e> 

ct _i 

o o 

o < 

Ct < 
CL 
LU 
LJ 

< L_ 

Q_ Q 

Ct Q 



< 
< 



it CO 
LJ O 
CO it 
—I >- 
__l Q 

co o 
_i _j 

i — < 

CO o 
O I — 
CO u_ 
X I— 

< < 

LJ ZL 
LJ < 
O O 
CL CO 
CO — I 
O LJ 



Ct 
< 

< 
_l 

< 
CO 

> 
< 

X 

o 
o 

< 
o ct 

< CL 

ct > 
o o 
ct co 
Ct o 

CL ^ 
CL 

o 

LU CO 



o 

CO 



it 

Cl 
Q 
CL 
< 
I— 
Ct 

I— 

CL 
Q 

I 

> 
CL 



3 

CL 
Ct 
Ct O 
O Ct 
it — I 



X 
CL 
Ct 
> 



CO 



CO 
CO 

» — » 

it 

CO 

o 



it 
ct o 



CL 

u_ 

LU 
CL 
it 
< 



CO 



CL O 
Ct CL 
>- Q 



o 
o 
ct 
o 



CL 
Cl 
Cl 
Ct 
Q_ 
O CO 



< 
Ct 

o 

LJ 
CO 



O 
it 
> 
< 
> 



O X 
> Cl 
LJ Ct 



< it 

< < 



O 

o 
it 

X 
X 
Cl 
Ct 
> 



8 



< 
< 
_i 
> 
< 

i 

> 
it 
o 
_l 
Q 
Q 
Q 
O 
it 
>- 
< 



it ^ 
LU CNJ 
CO • • 
Ct O 
CO Z 
D_ 

O Q 
CL — 

o 

CL 
< LJ 
Ct CO 



O 



> 
CL 
CL 
> 
< 



CO 

< 
< 

Ct Cl 
O 
X 
Cl 
Cl 



X 
LJ 
O 
CL 
Q 



< 
< 



o 
co 
o 



LU 

o 

CL 

o o 

CL CO 
CL CO 
LJ O 
S CO 
O CL 
CL O 
>- < 
CL Q 
Ct LJ 
O O 

I — o 



to 



X X X X X XX 

z. z. z. z. z z z 

SUBSTITUTE SHEET (RULE 26) 



WO 01/42434 



PCT/US00/33065 



10/12 




in 



SUBSTITUTE SHEET (RULE 26) 




SUBSTITUTE SHEET (RULE 26) 



WO 01/42434 



PCT/US00/33065 



12/12 




Q. 
Zi 

o 

CO 



5 



^ "53 * 
o > o 
c o c 



ro 
Ql 
o 
o 




GO 
CO 



SUBSTITUTE SHEET (RULE 26) 



WO 01/42434 



PCT/US00/33065 



SEQUENCE LISTING 
<110> APPLICANT: Merck & Co., Inc. 

<120> TITLE: DNA MOLECULES ENCODING HUMAN NHL, A DNA 
HELICASE 

<130> DOCKET/FILE REFERENCE: 20585 PCT 
<160> NUMBER OF SEQUENCES: 38 

<170> SOFTWARE: FastSEQ for Windows Version 4.0 

<210> SEQ ID NO:l 

<211> LENGTH: 4946 ' 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapien 

<220> FEATURE: 

<221> NAME /KEY: CDS 

<222> LOCATION: (828) . . . (4487) 

<400> SEQ ID NO:l 

agtcagccct gctgccagcc agtgccgggt gctggggact cagggaggcc cgccgggacc 60 

actgcgggac agtgagccga gcagaagctg gaacgcagga gaggaaggag agggggcggt 120 

cagggctctc aggagccggg tcctgggcaa ggcgcagccg ttttcaaatt ttcaggaaag 180 

cggtcggctc acactcgagc agtaaaaaga tgcctctggg gaggaggccc gtgcagctct 240 

ccgggcaatg gtggtggctc ggcctagaga ggcggtagtg gaacgcagac cctggtgggg 300 

gaatgacatc aagggaggag acgggcggga ccccagattt ctgcctgtgg gcgatggaag 360 

tgaggttcac tggccagcgg agccggacac agaacgcgca aaacgccgtg taggcctgga 420 

ggagccgaag agcaggcgga ccccctccgc gggggaacag tttccgccgg gagcacaaag 480 

caacggaccg gaagtggggg gcggaagtgc agtgggctca gcgccgactg cgcgcctctg 540 

cccgcgaaaa ctctgagctg gctgacagct ggggacgggt ggcggccctc gactggagtc 600 

ggttgagttc ctgagggacc ccggttctgg aaggttcgcc gcggagacaa gtgagcagtc 660 

tgtgccatag ggattctcga agagaacagc gttgtgtccc agtgcacatg ctcgcatcgc 720 

ttaccaggag tgcccgagac cctaagatgt tcggagtggt tttttcgcac agacccgaat 780 

agcctgcccc tcagccacgc tctgtgccct tctgagaaca ggctgat atg ccc aag 836 

Met Pro Lys 
1 

ata gtc ctg aat ggt gtg acc gta gac ttc cct ttc cag ccc tac aaa 884 
lie Val Leu Asn Gly Val Thr Val Asp Phe Pro Phe Gin Pro Tyr Lys 
5 10 15 

tgc caa cag gag tac atg acc aag gtc ctg gaa tgt ctg cag cag aag 932 
Cys Gin Gin Glu Tyr Met Thr Lys Val Leu Glu Cys Leu Gin Gin Lys 
20 25 30 35 

gtg aat ggc ate ctg gag age cct acg ggt aca ggg aag acg ctg tgc 980 
Val Asn Gly lie Leu Glu Ser Pro Thr Gly Thr Gly Lys Thr Leu Cys 
40 45 50 

ctg ctg tgc acc acg ctg gec tgg cga gaa cac etc cga gac ggc ate 1028 
Leu Leu Cys Thr Thr Leu Ala Trp Arg Glu His Leu Arg Asp Gly lie 
55 60 65 

tct gec cgc aag att gee gag agg gcg caa gga gag ctt ttc ccg gat 1076 
Ser Ala Arg Lys lie Ala Glu Arg Ala Gin Gly Glu Leu Phe Pro Asp 
70 75 80 
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egg gec ttg tea tec tgg ggc aac get get get get get gga gac ccc 1124 
Arg Ala Leu Ser Ser Trp Gly Asn Ala Ala Ala Ala Ala Gly Asp Pro 
85 90 95 

ata get tgc tac acg gac ate cca aag att att tac gee tec agg ace 1172 
lie Ala Cys Tyr Thr Asp lie Pro Lys lie He Tyr Ala Ser Arg Thr 
100 105 110 115 

cac teg caa etc aca cag gtc ate aac gag ctt egg aac acc tec tac 1220 
His Ser Gin Leu Thr Gin Val He Asn Glu Leu Arg Asn Thr Ser Tyr 
120 125 130 

egg cct aag gtg tgt gtg ctg ggc tec egg gag cag ctg tgc ate cat 1268 
Arg Pro Lys Val Cys Val Leu Gly Ser Arg Glu Gin Leu Cys He His 
135 140 145 

cct gag gtg aag aaa caa gag agt aac cat eta cag ate cac ttg tgc 1316 
Pro Glu Val Lys Lys Gin Glu Ser Asn His Leu Gin He His Leu Cys 
150 155 160 

cgt aag aag gtg gca agt cgc tec tgt cat ttc tac aac aac gta gaa 1364 
Arg Lys Lys Val Ala Ser Arg Ser Cys His Phe Tyr Asn Asn Val Glu 
165 170 175 

gaa aaa age ctg gag cag gag ctg gec age ccc ate ctg gac att gag 1412 
Glu Lys Ser Leu Glu Gin Glu Leu Ala Ser Pro He Leu Asp He Glu 
180 185 190 195 

gac ttg gtc aag age gga age aag cac agg gtg tgc cct tac tac ctg 1460 
Asp Leu Val Lys Ser Gly Ser Lys His Arg Val Cys Pro Tyr Tyr Leu 
200 205 210 

tec egg aac ctg aag cag caa gee gac ate ata ttc atg ccg tac aat 1508 
Ser Arg Asn Leu Lys Gin Gin Ala Asp He He Phe Met Pro Tyr Asn 
215 220 225 

tac ttg ttg gat gec aag age cgc aga gca cac aac att gac ctg aag 1556 
Tyr Leu Leu Asp Ala Lys Ser Arg Arg Ala His Asn He Asp Leu Lys 
230 235 240 

ggg aca gtc gtg ate ttt gac gaa get cac aac gtg gag aag atg tgt 1604 
Gly Thr Val Val He Phe Asp Glu Ala His Asn Val Glu Lys Met Cys 
245 250 255 

gaa gaa teg gca tec ttt gac ctg act ccc cat gac ctg get tea gga 1652 
Glu Glu Ser Ala Ser Phe Asp Leu Thr Pro His Asp Leu Ala Ser Gly 
260 265 270 275 

ctg gac gtc ata gac cag gtg ctg gag gag cag acc aag gca gcg cag 1700 
Leu Asp Val He Asp Gin Val Leu Glu Glu Gin Thr Lys Ala Ala Gin 
280 285 290 

cag ggt gag ccc cac ccg gag ttc age gcg gac tec ccc age cca ggg 1748 
Gin Gly Glu Pro His Pro Glu Phe Ser Ala Asp Ser Pro Ser Pro Gly 
295 300 305 

ctg aac atg gag ctg gaa gac att gca aag ctg aag atg ate ctg ctg 1796 
Leu Asn Met Glu Leu Glu Asp He Ala Lys Leu Lys Met He Leu Leu 
310 315 320 
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cgc ctg gag ggg gcc ate gat get gtt gag ctg cct gga gac gac age 1844 
Arg Leu Glu Gly Ala lie Asp Ala Val Glu Leu Pro Gly Asp Asp Ser 
325 330 335 

ggt gtc acc aag cca ggg age tac ate ttt gag ctg ttt get gaa gcc 1892 
Gly Val Thr Lys Pro Gly Ser Tyr lie Phe Glu Leu Phe Ala Glu Ala 
340 345 350 355 

cag ate acg ttt cag acc aag ggc tgc ate ctg gac teg ctg gac cag 1940 
Gin lie Thr Phe Gin Thr Lys Gly Cys lie Leu Asp Ser Leu Asp Gin 
360 365 370 

ate ate cag cac ctg gca gga cgt get gga gtg ttc acc aac acg gcc 1988 
lie He Gin His Leu Ala Gly Arg Ala Gly Val Phe Thr Asn Thr Ala 
375 380 385 

gga ctg cag aag ctg gcg gac att ate cag att gtg ttc agt gtg gac 2036 
Gly Leu Gin Lys Leu Ala Asp He He Gin He Val Phe Ser Val Asp 
390 395 400 

ccc tec gag ggc age cct ggt tec cca gca ggg ctg ggg gcc tta cag 2084 
Pro Ser Glu Gly Ser Pro Gly Ser Pro Ala Gly Leu Gly Ala Leu Gin 
405 410 415 

tec tat aag gtg cac ate cat cct gat get ggt cac egg agg acg get 2132 
Ser Tyr Lys Val His He His Pro Asp Ala Gly His Arg Arg Thr Ala 
420 425 430 435 

cag egg tct gat gcc tgg age acc act gca gcc aga aag cga ggg aag 2180 
Gin Arg Ser Asp Ala Trp Ser Thr Thr Ala Ala Arg Lys Arg Gly Lys 
440 445 450 

gtg ctg age tac tgg tgc ttc agt ccc ggc cac age atg cac gag ctg 2228 
Val Leu Ser Tyr Trp Cys Phe Ser Pro Gly His Ser Met His Glu Leu 
455 460 465 

gtc cgc cag ggc gtc cgc tec etc ate ctt acc age ggc acg ctg gcc 2276 
Val Arg Gin Gly Val Arg Ser Leu He Leu Thr Ser Gly Thr Leu Ala 
470 475 480 

ccg gtg tec tec ttt get ctg gag atg cag ate cct ttc cca gtc tgc 2324 
Pro Val Ser Ser Phe Ala Leu Glu Met Gin He Pro Phe Pro Val Cys 
485 490 495 

ctg gag aac cca cac ate ate gac aag cac cag ate tgg gtg ggg gtc 2372 
Leu Glu Asn Pro His He He Asp Lys His Gin He Trp Val Gly Val 
500 505 510 515 

gtc ccc aga ggc ccc gat gga gcc cag ttg age tec gcg ttt gac aga 2420 
Val Pro Arg Gly Pro Asp Gly Ala Gin Leu Ser Ser Ala Phe Asp Arg 
520 525 530 

egg ttt tec gag gag tgc tta tec tec ctg ggg aag get ctg ggc aac 2468 
Arg Phe Ser Glu Glu Cys Leu Ser Ser Leu Gly Lys Ala Leu Gly Asn 
535 540 545 

ate gcc cgc gtg gtg ccc tat ggg etc ctg ate ttc ttc cct tec tat 2516 
He Ala Arg Val Val Pro Tyr Gly Leu Leu He Phe Phe Pro Ser Tyr 
550 555 560 
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cct gtc atg gag aag age ctg gag ttc tgg egg gee cgc gac ttg gec 2564 
Pro Val Met Glu Lys Ser Leu Glu Phe Trp Arg Ala Arg Asp Leu Ala 
565 570 575 

agg aag atg gag gcg ctg aag ccg ctg ttt gtg gag ccc agg age aaa 2612 
Arg Lys Met Glu Ala Leu Lys Pro Leu Phe Val Glu Pro Arg Ser Lys 
580 585 590 595 

ggc age ttc tec gag ace ate agt get tac tat gca agg gtt gee gec 2660 
Gly Ser Phe Ser Glu Thr lie Ser Ala Tyr Tyr Ala Arg Val Ala Ala 
600 605 610 

cct ggg tec acc ggc gec ace ttc ctg gcg gtc tgc egg ggc aag gec 2708 
Pro Gly Ser Thr Gly Ala Thr Phe Leu Ala Val Cys Arg Gly Lys Ala 
615 620 625 

age gag ggg ctg gac ttc tea gac acg aat ggc cgt ggt gtg att gtc 2756 
Ser Glu Gly Leu Asp Phe Ser Asp Thr Asn Gly Arg Gly Val He Val 
630 635 640 

acg ggc etc ccg tac ccc cca cgc atg gac ccc egg gtt gtc etc aag 2804 
Thr Gly Leu Pro Tyr Pro Pro Arg Met Asp Pro Arg Val Val Leu Lys 
645 650 655 

atg cag ttc ctg gat gag atg aag ggc cag ggt ggg get ggg ggc cag 2852 
Met Gin Phe Leu Asp Glu Met Lys Gly Gin Gly Gly Ala Gly Gly Gin 
660 665 670 675 

ttc etc tct ggg cag gag tgg tac egg cag cag gcg tec agg get gtg 2900 
Phe Leu Ser Gly Gin Glu Trp Tyr Arg Gin Gin Ala Ser Arg Ala Val 
680 685 690 

aac cag gee ate ggg cga gtg ate egg cac cgc cag gac tac gga get 2948 
Asn Gin Ala He Gly Arg Val He Arg His Arg Gin Asp Tyr Gly Ala 
695 700 705 

gtc ttc etc tgt gac cac agg ttc gee ttt gee gac gca aga gee caa 2996 
Val Phe Leu Cys Asp His Arg Phe Ala Phe Ala Asp Ala Arg Ala Gin 
710 715 720 

ctg ccc tec tgg gtg cgt ccc cac gtc agg gtg tat gac aac ttt ggc 3044 
Leu Pro Ser Trp Val Arg Pro His Val Arg Val Tyr Asp Asn Phe Gly 
725 730 735 

cat gtc ate cga gac gtg gec cag ttc ttc cgt gtt gee gag cga act 3092 
His Val He Arg Asp Val Ala Gin Phe Phe Arg Val Ala Glu Arg Thr 
740 745 750 755 

atg cca gcg ccg gee ccc egg get aca gca ccc agt gtg cgt gga gaa 3140 
Met Pro Ala Pro Ala Pro Arg Ala Thr Ala Pro Ser Val Arg Gly Glu 
760 765 770 

gat get gtc age gag gec aag teg cct ggc ccc ttc ttc tec acc agg 3188 
Asp Ala Val Ser Glu Ala Lys Ser Pro Gly Pro Phe Phe Ser Thr Arg 
775 780 785 

aaa get aag agt ctg gac ctg cat gtc ccc age ctg aag cag agg tec 3236 
Lys Ala Lys Ser Leu Asp Leu His Val Pro Ser Leu Lys Gin Arg Ser 
790 795 800 
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tea ggg tea cca get gee ggg gac ccc gag agt age ctg tgt gtg gag 3284 
Ser Gly Ser Pro Ala Ala Gly Asp Pro Glu Ser Ser Leu Cys Val Glu 
805 810 815 

tat gag cag gag cca gtt cct gec egg cag agg ccc agg ggg ctg ctg 3332 
Tyr Glu Gin Glu Pro Val Pro Ala Arg Gin Arg Pro Arg Gly Leu Leu 
820 825 830 835 

gec gee ctg gag cac age gaa cag egg gcg ggg age cct ggc gag gag 3380 
Ala Ala Leu Glu His Ser Glu Gin Arg Ala Gly Ser Pro Gly Glu Glu 
840 845 850 

cag gec cac age tgc tec acc ctg tec etc ctg tct gag aag agg ccg 3428 
Gin Ala His Ser Cys Ser Thr Leu Ser Leu Leu Ser Glu Lys Arg Pro 
855 860 865 

gca gaa gaa ccg cga gga ggg agg aag aag ate egg ctg gtc age cac 3476 
Ala Glu Glu Pro Arg Gly Gly Arg Lys Lys lie Arg Leu Val Ser His 
870 875 880 

ccg gag gag ccc gtg get ggt gca cag acg gac agg gec aag etc ttc 3524 
Pro Glu Glu Pro Val Ala Gly Ala Gin Thr Asp Arg Ala Lys Leu Phe 
885 890 895 

atg gtg gee gtg aag cag gag ttg age caa gee aac ttt gec acc ttc 3572 
Met Val Ala Val Lys Gin Glu Leu Ser Gin Ala Asn Phe Ala Thr Phe 
900 905 910 915 

acc cag gee ctg cag gac tac aag ggt tec gat gac ttc gec gec ctg 3620 
Thr Gin Ala Leu Gin Asp Tyr Lys Gly Ser Asp Asp Phe Ala Ala Leu 
920 925 930 

gec gee tgt etc ggc ccc etc ttt get gag gac ccc aag aag cac aac 3668 
Ala Ala Cys Leu Gly Pro Leu Phe Ala Glu Asp Pro Lys Lys His Asn 
935 940 945 

ctg etc caa ggc ttc tac cag ttt gtg egg ccc cac cat aag cag cag 3716 
Leu Leu Gin Gly Phe Tyr Gin Phe Val Arg Pro His His Lys Gin Gin 
950 955 960 

ttt gag gag gtc tgt ate cag ctg aca gga cga ggc tgt ggc tat egg 3764 
Phe Glu Glu Val Cys lie Gin Leu Thr Gly Arg Gly Cys Gly Tyr Arg 
965 970 975 

cct gag cac age att ccc cga agg cag egg gca cag ccg gtc ctg gac 3812 
Pro Glu His Ser lie Pro Arg Arg Gin Arg Ala Gin Pro Val Leu Asp 
980 985 990 995 

ccc act gga aga acg gcg ccg gat ccc aag ctg acc gtg tec acg get 3860 
Pro Thr Gly Arg Thr Ala Pro Asp Pro Lys Leu Thr Val Ser Thr Ala 
1000 1005 1010 

gca gec cag cag ctg gac ccc caa gag cac ctg aac cag ggc agg ccc 3908 
Ala Ala Gin Gin Leu Asp Pro Gin Glu His Leu Asn Gin Gly Arg Pro 
1015 1020 1025 

cac ctg teg ccc agg cca ccc cca aca gga gac cct ggc age caa cca 3956 
His Leu Ser Pro Arg Pro Pro Pro Thr Gly Asp Pro Gly Ser Gin Pro 
1030 1035 1040 
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cag tgg ggg tct gga ,gtg ccc aga gca ggg aag cag ggc cag cac gcc 4004 
Gin Tip Gly Ser Gly Val Pro Arg Ala Gly Lys Gin Gly Gin His Ala 
1045 1050 1055 

gtg age gcc tac ctg get gat gcc cgc agg gcc ctg ggg tec gcg ggc 4052 
Val Ser Ala Tyr Leu Ala Asp Ala Arg Arg Ala Leu Gly Ser Ala Gly 
1060 1065 1070 1075 

tgt age caa etc ttg gca gcg ctg aca gcc tat aag caa gac gac gac 4100 
Cys Ser Gin Leu Leu Ala Ala Leu Thr Ala Tyr Lys Gin Asp Asp Asp 
1080 1085 1090 

etc gac aag gtg ctg get gtg ttg gcc gcc ctg acc act gca aag cca 4148 
Leu Asp Lys Val Leu Ala Val Leu Ala Ala Leu Thr Thr Ala Lys Pro 
1095 1100 1105 

gag gac ttc ccc ctg ctg cac agg ttc age atg ttt gtg cgt cca cac 4196 
Glu Asp Phe Pro Leu Leu His Arg Phe Ser Met Phe Val Arg Pro His 
1110 1115 1120 

cac aag cag cgc ttc tea cag acg tgc aca gac ctg acc ggc egg ccc * 4244 
His Lys Gin Arg Phe Ser Gin Thr Cys Thr Asp Leu Thr Gly Arg Pro . 
1125 1130 1135 

tac ccg ggc atg gag cca ccg gga ccc cag gag gag agg ctt gcc gtg 4292 
Tyr Pro Gly Met Glu Pro Pro Gly Pro Gin Glu Glu Arg Leu Ala Val 
1140 1145 1150 1155 

cct cct gtg ctt acc cac agg get ccc caa cca ggc ccc tea egg tec 4340 
Pro Pro Val Leu Thr His Arg Ala Pro Gin Pro Gly Pro Ser Arg Ser 
1160 1165 1170 

gag aag acc ggg aag acc cag age aag ate teg tec ttc ctt aga cag 4388 
Glu Lys Thr Gly Lys Thr Gin Ser Lys lie Ser Ser Phe Leu Arg Gin 
1175 1180 1185 

agg cca gca ggg act gtg ggg gcg ggc ggt gag gat gca ggt ccc age 4436 
Arg Pro Ala Gly Thr Val Gly Ala Gly Gly Glu Asp Ala Gly Pro Ser 
1190 1195 1200 

cag tec tea gga cct ccc cac ggg cct gca gca tct gag tgg ggc etc 4484 
Gin Ser Ser Gly Pro Pro His Gly Pro Ala Ala Ser Glu Trp Gly Leu 
1205 1210 1215 

tag gatgtgccca gcctgccaca ccgcctccag gaagcagagc gtcatgeagg 4537 



tcttctggcc agagccccag tgagtgccca cggaggcccc cagcacaccc aacgtggctt 4597 

gatcacctgc ctgtccagct ctggtgggcc aagaacccac ccaacagaat aggccagccc 4657 

atgccagccg gcttggcccg ctgcaggcct caggcaggcg gggcccatgg ttggtccctg 4717 

cggtgggacc ggatctgggc ctgcctctga gaagccctga gctaccttgg ggtctggggt 4777 

gggtttctgg gaaagtgctt ccccagaact tccctggctc ctggcctgtg agtggtgcca 4837 

caggggcacc ccagctgagc ccctcaccgg gaaggaggag acccccgtgg gcacgtgtcc 4897 

acttttaatc aggggacagg gctctctaat aaagctgctg gcagtgccc 4946 
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<210> SEQ ID NO: 2 

<211> LENGTH: 1219 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 



<400> SEQ ID NO: 2 



Met 


Pro 


Lys 


He 


Val 


Leu 


Asn Gly Val Thr Val 


Asp 


Phe 


Pro 


Phe 


oin 


1 








5 






10 








15 




Pro 


Tyr 


Lys 


Cys 


Gin 


Gin 


Glu Tyr Met Thr Lys 


Val 


Leu Glu Cys 


Leu 








20 








25 






30 






Gin 


Gin 


Lys 


Val 


Asn 


Gly 


He 


Leu Glu Ser Pro 


Thr Gly Thr Gly 


Lys 






35 










40 




45 








Thr 


Leu 


Cys 


Leu 


Leu 


Cys 


Thr Thr Leu Ala Trp 


Arg 


Glu 


His 


Leu 


Arg 




50 










55 




60 










Asp 


Gly 


He 


Ser 


Ala 


Arg 


Lys 


He Ala Glu Arg 


Ala Gin Gly Glu 


Leu 


65 










70 




75 










80 


Phe 


Pro 


Asp 


Arg 


Ala 


Leu 


Ser 


Ser Trp Gly Asn 


Ala 


Ala 


Ala 


Ala 


Ala 










85 






90 








95 




Gly 


Asp 


Pro 


He 


Ala 


Cys 


Tyr 


Thr Asp lie Pro 


Lys 


He 


lie 


Tyr 


Ala 








100 








105 






110 






Ser 


Arg 


Thr 


His 


Ser 


Gin 


Leu 


Thr Gin Val lie 


Asn 


Glu 


Leu 


Arg 


Asn 






115 










120 




125 








Thr 


Ser 


Tyr 


Arg 


Pro 


Lys 


Val 


Cys Val Leu Gly 


Ser 


Arg 


Glu 


Gin 


Leu 




130 










135 




140 










Cys 


He 


His 


Pro 


Glu 


Val 


Lys 


Lys Gin Glu Ser 


Asn 


His 


Leu 


Gin 


lie 


145 










150 




155 










160 


His 


Leu 


Cys 


Arg 


Lys 


Lys 


Val 


Ala Ser Arg Ser 


Cys 


His 


Phe 


Tyr 


Asn 










165 






170 








175 




Asn 


Val 


Glu 


Glu 


Lys 


Ser 


Leu 


Glu Gin Glu Leu 


Ala 


Ser 


Pro 


He 


Leu 








180 








185 






190 






Asp 


He 


Glu 


Asp 


Leu 


val 


Lys 


Ser Gly Ser Lys 


His 


Arg 


Val 


Cys 


Pro 






195 










200 




205 








Tyr 


Tyr 


Leu 


Ser 


Arg 


Asn 


Leu 


Lys Gin Gin Ala 


Asp 


lie 


lie 


Phe 


Met 




210 










215 




220 










Pro 


Tyr 


Asn 


Tyr 


Leu 


Leu 


Asp 


Ala Lys Ser Arg 


Arg 


Ala 


His 


Asn 


He 


225 










230 




235 










240 


Asp 


Leu 


Lys 


Gly 


Thr 


Val 


Val 


He Phe Asp Glu 


Ala 


His 


Asn 


Val 


Glu 










245 






250 








255 




Lys 


Met 


Cys 


Glu 


Glu 


Ser 


Ala 


Ser Phe Asp Leu 


Thr 


Pro 


His 


Asp 


Leu 








260 








265 






270 






Ala 


Ser 


Gly 


Leu 


Asp 


Val 


He Asp Gin Val Leu 


Glu 


Glu 


Gin 


Thr 


Lys 






275 










280 




285 








Ala 


Ala 


Gin 


Gin 


Gly 


Glu 


Pro 


His Pro Glu Phe 


Ser 


Ala 


Asp 


Ser 


Pro 




290 










295 




300 










Ser 


Pro 


Gly 


Leu 


Asn 


Met 


Glu 


Leu Glu Asp lie 


Ala 


Lys 


Leu 


Lys 


Met 


305 










310 




315 










320 


He 


Leu 


Leu 


Arg 


Leu 


Glu 


Gly Ala He Asp Ala 


Val 


Glu 


Leu 


Pro 


Gly 










325 






330 








335 




Asp 


Asp 


Ser 


Gly 


Val 


Thr 


Lys 


Pro Gly Ser Tyr 


lie 


Phe 


Glu 


Leu 


Phe 








340 








345 






350 






Ala 


Glu 


Ala 


Gin 


He 


Thr 


Phe Gin Thr Lys Gly 


Cys 


He 


Leu Asp 


Ser 






355 










360 




365 








Leu 


Asp 


Gin 


lie 


He 


Gin 


His Leu Ala Gly Arg 


Ala Gly Val 


Phe 


Thr 




370 










375 




380 










Asn 


Thr 


Ala 


Gly 


Leu 


Gin 


Lys Leu Ala Asp He 


He 


Gin 


lie 


Val 


Phe 


385 










390 




395 










400 


Ser 


Val 


Asp 


Pro 


Ser 


Glu 


Gly Ser Pro Gly Ser 


Pro Ala Gly Leu 


Gly 










405 






410 








415 




Ala 


Leu 


Gin 


Ser 


Tyr 


Lys 


Val 


His lie His Pro 


Asp Ala Gly His 


Arg 








420 








425 






430 
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Arg 


Thr 


Ala 


Gin 


Arg 


Ser 


Asp 


Ala 

A. .La 






/IOC 












Arg 


Gly 


Lys 


vai 


Leu 


Ser 


Tyr 


Trp 




A C f\ 

4dU 










f± D J 




His 


Glu 


Leu 


vai 


Arg 


pi t* 
Gin 


Giy 


vai 


465 










4 / U 






Thr 


Leu 


Ala 


Pro 


17a 1 

vai 


Ser 


ber 


fne 










A Q C 
4oD 








Pro 


Val 


Cys 


Leu 


GlU 


Asn 


Pro 


IT " ~ 

HIS 








DUU 










Val 


Gly 


Val 


Val 


Pro 


Arg 


Gly 


Pro 






515 










ton 


Phe 


Asp 


Arg 


Arg 


Phe 


Ser 


Glu 


Glu 




530 










ci c 

535 




Leu 


Gly 


Asn 


lie 


Ala 


Arg 


Val 


Val 


545 










C C ft 






Pro 


Ser 


Tyr 


Pro 


Val 


Met 


Glu 


Lys 










565 








Asp 


Leu 


Ala 


Arg 


Lys 


Met 


Glu 


TV T _ 

Ala 








580 










Arg 


Ser 


Lys 


Gly 


Ser 


Phe 


Ser 


Glu 






595 










C ft A 

600 


Val 


Ala 


Ala 


Pro 


Gly 


Ser 


Thr 


Gly 




610 










615 




Gly 


Lys 


Ala 


Ser 


Glu 


Gly 


Leu 


Asp 


625 










C "5 ft 
O JO 






Val 


lie 


Val 


Thr 


Gly 


Leu 


Pro 


Tyr 










645 








Val 


Leu 


Lys 


Met 


Gin 


Phe 


Leu 


Asp 








660 










Gly 


Gly 


Gin 


Phe 


Leu 


Ser 


Gly 


Gin 






675 










680 


Arg 


Ala 


Val 


Asn 


Gin 


Ala 


lie 


Gly 




690 










695 




Tyr 


Gly 


Ala 


Val 


Phe 


Leu 


Cys 


Asp 


705 










710 






Arg 


Ala 


Gin 


Leu 


Pro 


Ser 


Trp 


Val 










725 








Asn 


Phe 


Gly 


His 


Val 


lie 


Arg 


Asp 








1 A A 

740 










Glu 


Arg 


Thr 


Met 


Pro 


Ala 


Pro 


TV ~l ~ 

Ala 






ICC 

755 










i c n 
/ bU 


Arg 


Gly 


Glu 


Asp 


Ala 


vai 


Ser 


GlU 




n a 
/ /0 














Ser 


Thr 


Arg 


Lys 


Ala 


Lys 


Ser 


Leu 


*7 O C 
/OD 










Ton 






Gin 


Arg 


Ser 


Ser 


Giy 


Ser 


Pro 


ax a 










one 
oU-5 








Cys 


Val 


GlU 


Tyr 


GlU 


pi -n 

Gin 


pi ii 
GlU 


Pro 








O £t \J 










Gly 


Leu 


Leu 


Ala 


Ala 


Leu 


Glu 


His 






835 










840 


Gly 


Glu 


Glu 


Gin 


Ala 


His 


Ser 


Cys 




850 










855 




Lys 


Arg 


Pro 


Ala 


Glu 


Glu 


Pro 


Arg 


865 










870 






Val 


Ser 


His 


Pro 


Glu 


Glu 


Pro 


Val 










885 








Lys 


Leu 


Phe 


Met 


Val 


Ala 


Val 


Lys 



900 



Trp 


Ser 


inr 


i nr 


riXa 


nld 


Arg 


Lys 










41 J 








Cys 


fne 


Coy 

bet 


Pro 


biy 


nis 


OCX 


nc L 








SOU 










Arg 


Ser 


Leu 


lie 


Leu 


i nr 


Cor 


PI \j- 






*i / D 










Aft 0, 


ill a 

Aia 


Leu 


Pi ii 
GlU 




pi — 
Gin 


Ho 

j. ie 


Pro 


fne 


















Tl — 

lie 


lie 


Asp 


Lys 


MIS 


Pi Y\ 

Gin 


Tl 0 

lie 


Trp 


c n. e 










D JL U 






Asp 


Giy 


Ala 


pi _ 
Gin 


Leu 


ber 


Ser 


Aia 


















Cys 


Leu 


Ser 


Ser 


Leu 


Gly 


Lys 


7\ 1 _ 

Ala 








C A ft 

b40 










Pro 


Tyr 


Gly 


Leu 


Leu 


Tl- 
IlS 


pne 


pne 






ccc 

555 










DbU 


Ser 


Leu 


Glu 


Pne 


Trp 


Arg 


Ala 


Arg 




570 










C 1 c 
J / 0 




Leu 


Lys 


Pro 


Leu 


Phe 


Val 


Glu 


Pro 


585 










C Q ft 






Thr 


lie 


Ser 


■jv 1 _ 

Ala 


Tyr 


Tyr 


Ala 


Arg 










C ft c 

dOd 








Ala 


Thr 


Pne 


Leu 


Ala 


vai 


Cys 


Arg 








DZ U 










Phe 


Ser 


Asp 


Thr 


Asn 


Gly 


Arg 


pi,, 
Gly 






C 1 C 

635 










a a ft 
o4U 


Pro 


Pro 


Arg 


Met 


Asp 


Pro 


Arg 


vai 














boo 




Glu 


Met 


Lys 


Gly 


pi — 
Gin 


Gly 


Gly 


Ala 


c c c 

665 










C 1 ft 

D/0 






Glu 


Trp 


Tyr 


Arg 


pi « 
Gin 


pi — . 

Gin 


Ala 


Ser 










c o c 








Arg 


Val 


lie 


Arg 


His 


Arg 


Gin 


Asp 








700 










His 


Arg 


Pne 


7v n _ 

Ala 


pne 


Ala 


Asp 


Ala 






715 










n n 
f Z\J 


Arg 


Pro 


His 


Val 


Arg 


Val 


Tyr 


Asp 




730 










/JO 




Val 


» 1 _, 

Ala 


pi « 
Gin 


pne 


Pne 


Arg 


vai 


aia 

Aia 


*7 A C 










/ DU 






Pro 


Arg 


Ala 


Thr 


Aia 


Pro 


Ser 


T7a 1 

vai 










/OD 








Ala 


Lys 


Ser 


Pro 


Giy 


Pro 


rfle 


rne 








Ton 
/ OU 










Asp 


Leu 


14 i c 
HIS 


Val 


Pro 


Ser 


Leu 


Lys 






/ y «? 










800 


Ala 


Pi \r 

Giy 


Asp 


Pro 


pi ii 

oil! 


Ser 


Ser 


Leu 














OIJ 




Vai 


Pro 


Ala 


Arg 


Pi n 

uin 


Arg 


Pro 


Arg 


one 
OZ j 
















Ser 


p i 
GlU 


pi_ 

Gin 


Arg 


a1=» 

Aia 


pi,. 

Giy 


oer 


Pro 










845 








Ser 


Thr 


Leu 


Ser 


Leu 


Leu 


Ser 


Glu 








860 










Gly 


Gly 


Arg 


Lys 


Lys 


He 


Arg 


Leu 






875 










880 


Ala 


Gly 


Ala 


Gin 


Thr 


Asp 


Arg 


Ala 




890 










895 




Gin 


Glu 


Leu 


Ser 


Gin 


Ala 


Asn 


Phe 


905 










910 
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Aid. Thr Phe 


inr 


pi n 
tj in 


Ala 
Ala 


Leu 


pin A en 


Tyr 


Lys 


pi v 


Cof Aon 


Asp 


Phe 












J £» \l 








925 






Aid Ala Leu 


J\±a 


Ala 
Ala 


Cys 


Leu 




Leu 




AT a 
t\xa 


P "1 ii A cn 


Pro 












j j j 








940 








JjyS HIS ASn 


Leu 


Leu 


pi n 

Vjin 


Gly 


Phe Tyr 


Gin 




Val 


Arg Pro 


His 


His 
















955 








960 


IjyS Li in L»xn 


yne 


L»1U 


Glu 


Val 


Cys lie 


p i n 
win 


Leu 


Thr 


Gly Arg 


pi v 


Cys 














J / V 








975 




Gly Tyr Arg 


Pro 


Glu 


His 


Ser 


lie Pro 


Arg 


Arg 


Gin 


Arg Ala 


Gin 


Pro 












985 








990 






Val Leu Asp 


Pro 


Thr 


Gly 


Arg 


Thr Ala 


Pro 


Asp 


Pro 


Lys Leu 


Thr 


Val 


y y o 










1000 








1005 






Ser Thr Ala 


Ala 


Ala 


Gin 


Gin 


Leu Asp 


Pro 


Gin 


Glu 


His Leu 


Asn 


Gin 


1010 








1015 






1020 






Gly Arg Pro 


His 


Leu 


Ser 


Pro Arg Pro 


Pro 


Pro 


Thr Gly Asp 


Pro 


Gly 


1025 






1030 






1035 






104 


Ser Gin Pro 


Gin 


Trp 


Gly Ser Gly Val 


Pro 


Arg Ala Gly Lys 


Gin 


Gly 






1045 






1050 






1055 


Gin His Ala 


Val 


Ser 


Ala 


Tyr 


Leu Ala 


Asp 


Ala 


Arg 


Arg Ala 


Leu Gly 




1060 






1065 






1070 




Ser Ala Gly 


Cys 


Ser 


Gin 


Leu 


Leu Ala 


Ala 


Leu 


Thr 


Ala Tyr 


Lys 


Gin 


1075 








1080 








1085 






Asp Asp Asp 


Leu Asp 


Lys 


Val 


Leu Ala 


Val 


Leu 


Ala 


Ala Leu 


Thr 


Thr 


1090 








1095 






1100 






Ala Lys Pro 


Glu 


Asp 


Phe 


Pro 


Leu Leu 


His 


Arg 


Phe 


Ser Met 


Phe 


Val 


1105 






1110 






1115 






112 


Arg Pro His 


His 


Lys 


Gin 


Arg 


Phe Ser 


Gin 


Thr 


Cys 


Thr Asp 


Leu 


Thr 






1125 






1130 






1135 


Gly Arg Pro 


Tyr 


Pro Gly Met Glu Pro 


Pro Gly Pro Gin Glu Glu Arg 




1140 






1145 






1150 




Leu Ala Val 


Pro 


Pro 


Val 


Leu 


Thr His 


Arg 


Ala 


Pro 


Gin Pro Gly 


Pro 


1155 








1160 








1165 






Ser Arg Ser Glu Lys Thr Gly Lys Thr Gin 


Ser 


Lys 


He Ser 


Ser 


Phe 


1170 








1175 






1180 






Leu Arg Gin 


Arg 


Pro Ala Gly Thr Val Gly Ala Gly Gly Glu Asp Ala 


1185 






1190 






1195 






120 


Gly Pro Ser Gin Ser Ser Gly Pro Pro 


His 


Gly 


Pro 


Ala Ala 


Ser 


Glu 



1205 1210 1215 



Trp Gly Leu 



<210> SEQ ID NO: 3 

<211> LENGTH: 114793 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapien 

<400> SEQ ID NO: 3 

tgaagagctt tgaccaagag gctgtgacga ggccctacga ggactctggc tctcctcctg 60 

ctaagcacac ccaggcaggt gtcctggcag atgaggacca catgcagagc ctcggccagc 120 

ccaccaatgc ccggatatgc aagtgagccc agcctggacc ccccggcgag gcccagcagc 180 

accagcccag gcccgaaaac cttaagaaat gaccagtgtc tgctgcttta agccaccaag 240 

ctctgcggtg gtttgttagg ctgcaagcat ggctaattca gaaactgcca gaaacaagca 300 

ctgctgtccc cagcctggga cacacagcac cgcctctgcg tggggagagg gcacaggcta 360 

agggcacaaa tgccatccca gacccggctc ttgtgtgtgg aaggggccac tgtgccatga 420 

ggcagaggaa accttggcag gaccttatgc cacagcaatt taaaagagaa gaaacaggct 480 

gggcgtggtg gctcatgcct ataatcccag cactttggga ggccaaggtg gtggatcact 540 

tgaggtcagg agttcaagac cagcctggcc aatatggtga aaccctgtct ctacgaaaaa 600 

tacaaaattt aggcaggcgt ggtggcgggt gcctgtaatc cctgctattc aggaggctga 660 

ggcaagagat ttacttgaac ccaggaggtg gaggctgctg cagtgagctg agatcatgcc 720 

actgcactcc agcctgtgtg acggagtgag acttggtctc aaaaaaaaaa aaggaaacac 780 
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atctgactag 
tccaaacaag 
agcctgtaac 
caagactagc 
cagtcccagc 
agtcccagct 
gccatgagcc 
cgaaaaacaa 
gtaaaaaatg 
tttaaaaaag 
tcacacctta 
atcagtgagg 
ccgcacgctc 
gacgtgccat 
gctgggaagg 
agaaatgatc 
accagcactt 
ctttcctctg 
acagaccggg 
tccctccaga 
agctgacgac 
gaattaaaaa 
tctttctttt 
gcagtggcgc 
cctcagcctc 
caaaagaaaa 
ctctacttct 
taacatccta 
ccaccttcca 
actgcgccag 
caagacaccc 
ttttattact 
tgtcacccag 
gttcaagcaa 
tggctaattt 
aaattcctga 
agccactgcg 
cacatttcaa 
ctcttgggtg 
agagccgggc 
tgggcaacac 
aggcaagaga 
ttatactaga 
agcaattact 
ctaggaaatc 
gtgagaactg 
gagggctcct 
ccgtggtcta 
aagagagtct 
gagggcctga 
gatgctgtgc 
gctctgcctt 
gaaataatgg 
aagggaaggg 
atggttcaca 
attcggggcc 
tttttctttt 
gagaaaaccc 
caggcacgac 
ggacatctgg 
catgctaatg 



tgtgatctcg 
gaatccccag 
cccagtgctt 
ttgggcaaca 
actgtactaa 
actcaggagg 
gtgactgtgc 
aaaatcattt 
gttctgaccc 
agatggtctc 
gaaagctttc 
tgccttcgaa 
agtaggggag 
aaccaagaag 
cccaggaaaa 
tcaacgtttt 
ggtgatgtcg 
caactgtggg 
caaagcagag 
gacccctgtg 
aggggttggt 
taataacaac 
tctttttttt 
aatcttggct 
ctgagtaact 
ctgacacaat 
cactgttctt 
gaaaaaaagc 
cagtgaccac 
ccgtgcgcct 
acagaggaga 
acaacatttt 
gctggagtgc 
ttctcctgct 
ttgtactttt 
caagtgatcc 
cctggcccat 
atgggtaact 
tgagagtcta 
agggtaaaga 
ccactccatc 
ccgagggcac 
tatagatcat 
ctttattcca 
ataacctagg 
accagaagac 
tggtctagcg 
gcggtagcct 
ccctttcccc 
catcagtcag 
ttcagtggtc 
ctagatagca 
cgtaagcggt 
ccccctggcc 
ctccttaccc 
gctaccagtc 
atctttgtct 
accaaccctg 
tcagctgctg 
cccagtctcc 
ggcaatatga 



caaggaacat 
gggatcaaat 
cgggaggctg 
cagtgagagc 
aaatctacac 
ctgaggcagg 
cactgcactc 
catgtctcca 
acagaaatct 
agaataaaga 
tggccgcatc 
caagccacct 
gggaagtcgc 
ccccagccac 
ggcgggaggc 
gtaacttcct 
cttgcgtcaa 
cttacggggc 
gagagccacc 
gccaaggcca 
gaggccggcc 
tatgtcttgt 
tttcggagac 
cactgcaagc 
gggactacag 
cataaacaga 
caaagagtta 
tcctactcag 
aggaaaccaa 
gtgtctgcca 
gctctaagcc 
tgttttcttt 
agtggcacaa 
gcagcctccc 
agtagagatg 
acccaccctg 
ttttgtttat 
ccagtgtcct 
gcatgtgggt 
gaccccctag 
agatggaccc 
gagctgttcc 
aaatatgatt 
atattataat 
aaaaactagg 
aagagtgcga 
gtaacgccag 
cagtgtcaag 
gggggagttt 
gcccgcccgc 
acgctcctag 
gcagcaaatt 
ctctctctct 
agtgggcacg 
tgccgctttg 
tccgcgtctt 
tgtgtcttta 
tggggctggt 
tagtgaattg 
tcgttgagcc 
ggactaaatg 



tccagacaca 
tgttttgcat 
aggtgggagg 
ccattagcca 
ggggccgggc 
acgattcctt 
cagtctgtgc 
gtttctccac 
aagaaaggaa 
ccaacctggg 
tggccaaagg 
gccctggagc 
taggttttag 
acccagaccc 
ggacgaatgg 
accaagaggc 
tcaaggcaac 
aaagaagtcc 
cagccgagcc 
cccagaccag 
cagaccagca 
cttaagccac 
gcagtctcac 
tctgcccccc 
gtgcctgcca 
gcaagcaaga 
actcaagcat 
tgttcataaa 
gacagcaagt 
ctgccctggt 
acaactgtgt 
tttttttttt 
tctcggctca 
aactggatta 
gggtttcacc 
gcctcccaaa 
caataaaaat 
tgatgctcct 
ggtctggaca 
gatgtgaagg 
tggtatgggc 
ggtgtaataa 
atatatgaat 
aatccttgcc 
ccatacagag 
gccttctgtt 
catctgggaa 
gaaaaacacc 
agagaagact 
agttatccgg 
tccgccttca 
agtgaaagta 
ctctcctctc 
tgacccacat 
tcttgtatcc 
ggtggtagtg 
tttctacact 
ccctacaccc 
cgatcctcca 
agttcacgaa 
ggatggtctc 



gtggagctag 
cggccagaca 
actgcttgag 
ggcgtggtgg 
atggtggcac 
gaacccagga 
aacagaacga 
tggcaaaaga 
aaaatataaa 
ctatggttgt 
gccaccctgc 
ccgtcctgtc 
ttcaccagtc 
gatgtggcca 
aaatgtcatt 
agtcttagct 
agaagtgagc 
aggcctccag 
tacctgtgcc 
caggtccttg 
gaaccacgaa 
taagttttgg 
tctgttgccc 
ggattcacgc 
ttgggtgttt 
gaacttggca 
aagatgtgag 
gcaaagctaa 
gggacaccag 
ccgtcactgc 
acgaagacaa 
tttgagactg 
ctgtaacctc 
caggcgcccg 
atgttggcca 
gtgctgggat 
gtacttaatg 
gcgacatgtt 
ggagggggag 
ccgccctgca 
ggcaagccac 
aatgcataaa 
atcattcatc 
taagcataac 
ataggagctg 
atgcctggac 
gacgcccgtt 
cgctacttag 
ctactcctcc 
aggcctaacc 
tgttccatcc 
ctgaaagtct 
tctctgcctc 
gaccttacct 
aataaatagc 
gtcccccagg 
ctctcatctc 
tggctttgta 
aacccagcaa 
aaagagactt 
caacttggac 



aaggttcttc 
tggtggctca 
tccaggagtt 
cacatgcctg 
atgcctgtag 
ggtcacggct 
gactctgttt 
ctctgtcaag 
aaatagaaaa 
cactcttccc 
cccatcttgg 
ttgtctgcca 
tctggatcaa 
caaggggtga 
ctgtggccac 
ctgcccttga 
aggaggccca 
gtggaggatc 
tcagactgcc 
ccaagctgtc 
ccaaccaaca 
atggtttctt 
aggctggagt 
cattcccctg 
tcttaaacag 
attatttcct 
caaattcttt 
cctacaggag 
cctccagggc 
caccagccgg 
ctgtgcagga 
agtctcgctc 
catctccctg 
ccaccacgcc 
gactggtctc 
tacaggtgtg 
ttgaactctc 
cgtgagactt 
ggaagagtgc 
tttgtcagac 
ctaggtgccg 
ataagaatag 
attagtttgt 
ctaggaaaaa 
aggggacata 
agggccacca 
gccaagtgga 
caaaccagga 
acctcttgcg 
gtctccctgt 
tgtgcacctg 
ctgataagca 
agctgccagg 
atcactggac 
gcaacctggc 
cccagctgtc 
cgcatacgag 
gactggagcc 
ggcacctgca 
ttctgagtga 
aaaccaacag 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
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taaaagccac tttgcgggga aagaaacttt tccttttttc ttttttttga gacaggatct 4500 

caccctgtca cccaggctgc agtgcagtgg catgaccttg gctcactgca gcctcaacct 4560 

ctctcaggct caagcaatcc tcccgcctca acctcccatg cagctgggac cataggtgca 4620 

tgccaccaca cccaaataat ttttatattt tttgtagaga cgaggtttca ctatgttgct 4680 

cgggctggtc tcaactcctg ggctcaagca accctcccac ctcagcctcc caaagtgctc 4740 

agattacagg caggagccac caggcctggc caacatagga agaaatttaa atttgaattg 4800 

aatattagaa gagatgaaaa ttcatcaaca tggaaagaca aagatcatta actaaagcca 4860 

aaccagaatg gaagctgtgt gtacagtggg gtctcatgct gggaacgcga ggggcacgtg 4920 

cagggctcca cggtgtggcg acgccccatg ctccctttgt gggggttcat ccagcggaac 4980 

atgaggacct ggggtgcttt tcaacatgta cgtgagttta ataataaaaa ggtttaagga 5040 

aagaaaaatt catatgtttc tatataaaca gaacatctgg aaagatctat tctaaggtgt 5100 

tgacagtagg aatctctagg tagtagtaat atggcctttt tgaatttttg cttatcagta 5160 

ttttctaatt ttctttttct ttctaaataa ttctagctat gaaataattt tctaccatat 5220 

atattttgta ataaaaatgg ttatatttaa ttttttaaag gctgtacaaa cttcctgata 5280 

aaatggcaaa ttagacacac acatgtgggc cgggtacagt ggctcgcgcc tgtaattcca 5340 

gcactttggg aggctgaggc aggcagatca cctaaggtca ggagtttgag accagcctgg 5400 

ccaacatggt gaaaccccgt ctctactaaa tatacaaaaa tgagctggat gtggtggcac 5460 

acacctatag tgccagctac ttgggaagct gaggcaggaa aattgcttca acccgggagg 5520 

cagaggttgt agtgagccga gatcatgcca ctgcactcca gcctaggcaa caagagcgag 5580 

actccaactc aaaaaaaaat aaaaataaca cacacgtgaa taggctcctc atggaagtca 5640 

tcacaacaat gcagagggaa gagcttccaa agtgtaaacc cagaagcgag gagcaggagg 5700 

gtgcgcgcag acgcagagag cagcaaggtg cagactgaga ggcggaggct ggccgtgggg 5760 

agatgactga tgctcagttt ataccccaaa tccgtaaatc tagaggcctg gcacatcaac 5820 

tacctctgcc agcaggaatg agggaaagga gggcaaccaa aagatgtccc accctcaccc 5880 

atccagctac ctgccatcct cagccccact ggcagaagac cctgagaggt ggaggcaggc 5940 

ccctgcctac aggaccctga gagctagggg aaggcgttat cctgaactgt gtcccccgta 6000 

aaattcatat gttgaaggcc tcatccccag tgtgactgta tttaaagatg gggtcttcag 6060 

gagataattt aaatgaggtc atataagttg gccctcatcc agtaagactt tgaccttctg 6120 

gtggtttttt tttttttgga gactgggtct cactctatca ctcaggttgg agtacagtgg 6180 

cacgatcacg gctcactgct gtctccaact cctgggctca ggtgatcctc ctgcttcagc 6240 

ctcctgagta gctgggacta caggtgctta ccaccgcacc cagctggtgg tgcattgtgt 6300 

tttttgtaga gatggggttt tgccatgtcg cccaggctgg tcctgaactg ggctcaagtg 6360 

atctgtctcc ctcggcctcc tgcagtgctg gaattacagg tatgagccac cgcgcctggc 6420 

cgaccgtgac cttctaagaa gtgaaagaga aagatctttc tctctccctc cctctccatc 6480 

atgaggacac agcaagaagt cggccatctg caaggtagaa agcgagtcct cccaacagct 6540 

gaacctggca gaccctgatc ttggacttca gccttcagag ctgtaagaaa ataactctct 6600 

gctgttcagg ccacgcggtc tacggcagcc cgagcagact aagacacacg ccatctgggg 6660 

agtcagacca gatcaggaag aaaggcctag agctcaggat actgaaggtc ccaacccggt 6720 

gctggaccag accaccccgg cagccgcggc cacggagtca cggctcgggt gaggtgacct 6780 

ggacaccatc ccggcagccg cggccacgga gtcacggctc gggtgaggtg acctggacac 6840 

catcccggca gccgcggcca cggtgtcacg gctcggatga gatgactcgg acaccacccc 6900 

ggcagccgcg gccacggtgt cagggctcag gtgaggagag ttggatatgg gactgggcct 6960 

accccgaggc tgcttccacc cagacgcctg ggtgggtgac acgaaagctg ggctcagttg 7020 

ggatcagagc agcctctccc caggtcagaa atgaccctgg gctcctcaca gtagccctag 7080 

ggcaccatga gaaagctacg tggacttctc tgaccaaggg tcactgctgc cacactactc 7140 

attgcaggcc atgtcagggc tcagctgagg agacgtggac accaccccag cagccgcggc 7200 

cacggcgtcc caagggaggg acttgggcac tgcctctctg ggcaagagtg gggaggtgtg 7260 

gggtgggaga tgtctggaaa catcatggac acatgccggg aaaacacgga agctgtgcac 7320 

caaggtgctg acaaaggaaa aaggagaatg gaggtgtgaa catccagcta gcaggtccca 7380 

ctcagaaact cctgcatttc cagacatggc caccagctct gtggatgaga caggggagga 7440 

cagggtacct cacaccagga acccacacag gtccatgtct tgctctgtga tcacacaaca 7500 

gcctccacca ccctgacatg caggagggag gtcaaagcct cgggtccaac aacaggctcc 7560 

acagcaaggg aagaaaggca ggaaggaact cagggccagg tcctcccagg cagcagctgc 7620 

ctgcacgctg tccaccaagg gaggtctgac ctacaccgca caggggttgg cagtctagag 7680 

tcgtcctctg tcaaacggtg agaaagtcaa aagctcatgc tcagtgatat gctaggtcag 7740 

catgaagatg ccacacatga gacacagcaa ggatgagacc aacgggaaga ctgccccaga 7800 

ccagagcccc agagccctct ggggaggaag aataaggatg gcagcctggg actgcccggg 7860 

gctgactctg cctttatttc accccagcag aggcaggagt gacaccggct cacagcagga 7920 

gcagctctgc cacctcctag cagttccacc tacgggcagc aaaacaaagc tggcagtttg 7980 

ggcaaatgtt agcgtttttg ccaactaaca tttgaatcgg acatctggta cagagatgag 8040 

gaagaaaaca ctcacagttt catgaagact gtcaagaaaa tcactgactc ttcacttcat 8100 
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ttatgaaagg ccagctctct gacatcccta ccactccctc tcacatgaga aatcacggcc 8160 

tttcaggacg tggagccacg tggccatgca ggtacgggag gcctccccgc agctgcagct 8220 

gggtcttctg gtccccgtgc catttctgct tttcttcgct ctctacttac acacacattt 8280 

gagtccagtc tcagaagaac tggaactaga aaaatcctga cacttgtccc ttactacgtt 8340 

aatgccagct gtgccaagga cagcccaacc caagccccca tcagccccaa tggcaccgag 8400 

gcccgagctt acccgtgagg ggccaagttg gtcgtcacca acacggtctt caccccctcc 8460 

acaccactgc cgtccactgc agtgtccgga gttgtcacaa ccaccacctc ctccatgtgc 8520 

acactcacgt cgggagtcgc catggctcag cggaagggga cgcccaggcc agcagcgtca 8580 

gtcctccagg gtcccaagtc ctggaggaag caaggcaggg cacagggatg gagtcatctc 8640 

cacatccaca caacatagca ctcacaaagg catctctaat cagctccaaa gacccaccct 8700 

tgagtcccag actgctacct cctgacaaaa acgagcggca acagaagggc tactccaggc 8760 

tctggttccg agggcggtgt aagcgcactc cacccgtttt tcccactgga taagccgaaa 8820 

cccttgggta gaaagcacag agccactccc tccacgtggg gctcagagca ggaggacagg 8880 

aggggcctgg aattccaagc aacttccctg gacgcaggct cccggcttgc cagttcttcc 8940 

gtctctcctg gcctgaactc aaagccagcc ccaatccctg aactgagttt caggtgcaga 9000 

aagcactcca agaagtcctc gctggtctgt ggaacgggaa gggaaaccca ttcaagacag 9060 

aaagagagga gggaaacgcc ctgggttttt ttgggttttt gggttttttt tgagacggag 9120 

tctcgctctg tcgcccaggc tggaatgcag tggcacgacc tcggctcact gcaagctcca 9180 

cctcctgggt tcaagtgatt ctcctgcctc agcctctcca attgctggga ttacaggttt 9240 

caccatgttg cccaggctgg tctcaaactc ctgacctcag gtgatccact cacctcggcc 9300 

tcccaaagtg ctgggattgc aggtgtgagg caccatgcct ggcctgcccc gggtttaaaa 9360 

attattatta ttttgtcttt cctggctttg ccttcagcaa gtccaacccc tgctaaaacc 9420 

cggtgataat ggctgtcctg gcccaaaaag cttggagaca ggggaatctt cctcctgact 9480 

aaaggaatgg tggcccaaga gtgtgggggc tccctgttgc cctctcactc tccatcccct 9540 

acctagcaca gggaacacaa aagcccctgg tttccagcca gagggcaacg agcctggagt 9600 

cagagtgtgg gggaggcgac aagaggagag gggagaagag aggatggcac acagctgtgt 9660 

gtgagcgcct gggtcgtccc aagacagtct ctacgtggtc ctgaccctaa agggcaaagg 9720 

gaagaaaact gacctacagg ataggccact gcccaggtct cagatgggcc ccagtggcgc 9780 

atatgggaca gatccacagt gcactggaaa gtctctaaaa taaactggcc taagaacaca 9840 

gacacaggaa cggggtgcaa aatttgcagc ctgaacctaa ccaggtcgat ttcttgctat 9900 

gaaaaaaaaa agtctacatt ctctgtgaaa cttaaaacaa gacctagagt ccatagcaca 9960 

gtagtcaaag catccagaac acgatcaaac ttcctggcaa agggtagtct ggttgattct 10020 

caaaggaaca aatacacaag agaagctggc tcttgaacgc agaatccaga gactttcagg 10080 

tgctatcgga ccagctccaa gaggaaagca aacattgtca accaagtgga aagaaaatct 10140 

tggtatagaa acaggagtta taaccaaaca gaaatgtgaa aattaaaaac gacaaccaaa 10200 

agaaaataca caaagctggg atagtctcag ctactcggaa ggcggggctg gaggatcgtt 10260 

tgagcctagg agattgaggc tgcaatgagc tgtgatcaca ccaccgcact ccagtctggg 10320 

caacagagtg agaactctct caaaaaacga aaaagaaaga aagtagaaca gaagtgacca 10380 

ggggctgggg gagggagtac agggagttgt tctttaatga gtacagaatt tctgtttggg 10440 

atgatgaaaa gctctggaaa tggacggcgg tgatggctgc acaatcactg tggctgttct 10500 

gaatggtgct gaaccacaca tttaaaaaca gttaaaatgg gctgggcgtg gtggctcacg 10560 

cctgtaatcc cagcactttg ggaggcggat cgcctgaggt caggagttcg agaccatcct 10620 

ggccaacaca gtgaaatcct gtcttgacta aaaatactaa aaattagcca ggcatggtgg 10680 

caggcacctg tagtcccagc tacttgggag gctggggcag gagacctgct tgaacccagg 10740 

aggcagaggt tgcagtgagc cgagatcgtg ccactgcact ccagcctggg caacaagagc 10800 

gaaactccat ctcaaaaaaa aaaaaaaaaa aaaaaaaaaa aagtttaaaa tggttaaatt 10860 

ttatgttatg tatattttac cgtaataaaa acactgtaat gctactataa tagaatgact 10920 

cattaggatt agatatagac tagaaagtac agaatataaa aactttttaa acaaagaaaa 10980 

attttcatgg ccaggcatgg tgtcacacct gtaatcccag gactttggga ggccaaggca 11040 

agaggaatgc ttgagctcag gggtttgaga ccagcctggg caacacagca acaccccatc 11100 

tctgctaaat aaataataaa aaatagccag gcatggtggt gtgcacgcct gtagttgcag 11160 

ctactctgga ggctgaggca ggaggatcac ttaagcccag gaggtcaagg ctgcagtgag 11220 

ccatggttgt gccactgcgc tccagcctgg gcaacagatc aagaccttgt cacaaaaaaa 11280 

agaaagaaag aaaagaaaaa agaaagaaaa taaaatcttc cagaactttt aaaatcatca 11340 

ttgttaatat aaaaataaca tcacctgccc ctaggactgt aacaaacaag tgtgtctaag 11400 

gacaggagtg ggtccacccc aacctggcac gcagtggtcc cctgcggaga gtctggccct 11460 

gcactcacta agaggaggca ctcatagccc agccaggcct ctgcaattat gccttcaatg 11520 

ccagaactaa ctcacccaaa ctgaacaatc gatcacaaaa tgtgccttca ggtctcaagg 11580 

ttcttgctaa atcttactca accgacattt tccagcatgg gaacattttt ctgaatgtct 11640 

tagggagagg aagtccgcaa gagaacaaaa ggtcctcagg ccaccctagc ttcttttcct 11700 

ccattccaca ggctgtcttt tgtctgggta tgcactggac cagggggctc tacttcttcc 11760 
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tacctgggca tgggtctcca cacaactcca aggtaaaggg ccacaggcaa gataaagggg 11820 

agaaaagaaa gctacgattt cctgggccac caatcgcaaa tggcagccag tctctgaagt 11880 

aacccttgac cagagatcca aggaaccaag aaatgtaggt gatctgaaca gaggggatgg 11940 

tggttaaaca ccatgaagga aagacccatt ctcaaagaaa aggaagcaaa aagaaaccgt 12000 

ggggagctgg gtaccacccg cagcaaagac cccgcacgcg ttactgacgc cagcctggcc 12060 

tgggagagca gtgagtgtgg cggacggtga gtggcgggga gggctgtggt aggtttaggg 12120 

taagaagggg cagcgcccag agcccagaga acaccagtga gggctccaca ggaacactac 12180 

tcaaagtatt cacggaacac atctaaacac aagcactaag gactaagtgc gagggacaag 12240 

aaaatattcc ccgtttcctg tttcaggagg gtatcgaaaa tgagtgatgg aaggaaaatg 12300 

tattgtttaa atgaggaaaa aaaattttta caaattaaga acatcctgga acatgatgag 12360 

ccgtttactg tcactcaatt taaatggtgg ccatctagga cagagcgcct aaggggaaag 12420 

ggggctcaca ggtgaacccc tccagctgct ggtgggcaat ttcccattag ggcatcaggg 12480 

tctctgaaga ctgtcttcag atgcttttta gccaggaaag ttacaatgat gaattcgttt 12540 

acactggcgg aattacttcg tatttctcaa atataatgtt ttcactagca taactttgtt 12600 

gttgtagact taggcttcaa aataaagaac tttaaacaaa catgaataaa aagccacttt 12660 

aggccgggcg cggtggctca cacttgtaat cccagcactt tgggaggccg cggcgggtgg 12720 

atcataaggt cagaagttca aagaccagcc tgatcaatac ggtgaaaccc cgtctctact 12780 

aaaaatacaa aaattagccg ggcgcggtgg caggtgcctg taatctcagc tacttgggag 12840 

gctgaggcag gagaatcgct tgaacctggg cagcagaggt tgcagtgagc caagatcatg 12900 

ccactgcact caagcctggg tgacagagtg agactctctc ttaaaaaaaa aaagccactt 12960 

taaaatttta ctcaggccag gtgtggtggc tcacgcccat aatcctagca ctttgggagg 13020 

ccgaggcgag cagatcacct gaggtcagga gttagaccag cctggccaac atggtaaaac 13080 

cttgtctcta ctgaaaacac aaaaattagc tgggcgtggt ggtgtgccca tgtaatccca 13140 

gctactcagg aggctgaagt gagagaactg cttgaacccg ggaggcagag gctgcagtgt 13200 

gccaagactg caccactaca cttcagcctg ggcgacagag caagaccctg tctcagaaaa 13260 

aaaaaaaatt caaaaatttg gccaggcgtg gtggctcacg cctgtaatcc catcactttg 13320 

gaaggccgag gcgggtggat cacctgaggt caggaattca agaccagcct ggccaccatg 13380 

atgaaaccct gtctctacta aaaatacaaa aaaaaaaaaa caaattggcc gggcatggtg 13440 

gcgggtgcct gtaatcccac ctacttggga ggctgaggca ggagaatctc tcgaactccg 13500 

gaggcagagg ttgcagcgag ccaagattgt gccactgcac tccagcctag acaacagagc 13560 

gagactctgt ctcaaaaaaa aaaaaattaa aattaaaaaa taaaaatttc atttaaaata 13620 

ctactgatct cccgtgctga cttctcgggg tttaactctc actgaggaga cgctgctttc 13680 

ataagggtaa gctcagcagg ggcaactaaa gtcatttaag cagagagctg caaagaggca 13740 

acagcctcac tgcaggcagg ggtcctcgtc acagcttcag ggctttgcag aggattacgc 13800 

aatgtacacg cacaaaactg aattccagcc tctccattgg caactgcata catacatata 13860 

ttcttttttt gagacggagt ctcgctctgt agcccaggtt ggactgcagt ggcccgatct 13920 

cggctcaatg caagctctgc ctcccgggtt caagcgattc tcttgcctca gcctcctgag 13980 

tagctgggat tacaggcgcc caccaccacg cccggctaat ttttgtattt ttagtagaga 14040 

cggggtttca ccatgttggc caggacagtc tcgatctcct gacctcgtga tccgcccgcc 14100 

tctgcctccc aaagtgctgg gattacaggc gtgagccact gagcctggcc tccaatggca 14160 

actatattaa aggttcaaag caatatgcac aaaagttacc tcacagaaaa tagtgcaagt 14220 

ccttgataca atgctcttta gacacagaag aagcactata gaatagagca cctcgcccta 14280 

ttgccttccc aagggcgagc accccctcct ctctccacag ctccttcttt gtttttttga 14340 

gatggagtct cgctctgtca cccaggctgg agtgcaatgg caaaatcttg gctcactgca 14400 

acctccgcct cccgggttga agtgattctc ctgcctcagc ctcccgagta gctgggacta 14460 

caggcaccca acacgcctag ctaatttttg catttttggt agagacgggg tttcatcatg 14520 

ttggccaggc tggtctcgaa ctcctgacct ccagtgatcc tcccaccttg acctcccata 14580 

gtgctgggat tataggtgtg agccactaca cctggcctct ccacagcccc ttctgtgttg 14640 

aagccaagac ccacccagct ttgatcccaa ggcttgggtt ccccactagt gtgaagtgag 14700 

tttccaaatt attaggtaaa tcagatatga gaaaatattt tattttactt tttttttttt 14760 

gagacgcaat cttgctccgt cacccaggct ggagtgcaat ggcaccatct ccactcactg 14820 

caacctctgc cttctgggtt caagcaattc tcctgcctca gcctcccaac tagctgggat 14880 

tacaagtgca caccaccacg cccggctaac ttttgtattt ttagtagaga cagggtttca 14940 

ccgtgttagc caggctgctc tcaaactcct gacctcatga tccgcccacg tcgggctccc 15000 

aaagtggtgg gattacaggt gtgagccatc acacctggcc caagaaaata tttttaaact 15060 

agtattcttg accggcacgg tcaacactga tgtaattgaa actgttgtat ttgaagtgtt 15120 

agcaaagaaa gagaattctg gttcaacaga aaagtcagtc acgacttttc agtcacgcat 15180 

gaattacaca gtaaccaaat agataacatg ccatgactga cgacgggccc acaacaaatc 15240 

agctccgacc aacagggtcc acaccaccat gggtctacac agatccaggt cccgcctgtg 15300 

agcctacagt gacgcgggcc cctgtggggt ggtccctgca ggtcaggtcc ctgagagtgg 15360 

gtcccagtgg ggtgatccct gcgggtcgcg tccctgcgag ttgggtgcct gccgggtggc 15420 
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ccctgcgggt cgggtgcctg cggggtggtc cctatgggtc gcgtccctgc gggtcgggtg 15480 

cctgcggggt ggcccctggg aatcgcgtcc ctgcgggtcg ggtgcctgcg gggtggcccc 15540 

tggggatcgc gtccctgcgg gtcgggtgcc tgcggggtgg cccctgggga tcgcgtccct 15600 

gcgggtcggg tgcctgcggg gtggtccttg tgggtcgcgt ccctgtgggg tggtccctgt 15660 

gggtcgcgtc cctgtggggt ggcccctgcg ggtcgcgtgg tggcccctgc gggtcgggtg 15720 

cctgcggggt ggtccctgtg ggtcgcgtcc ctgcgggtcg ggtgcctgcg gggtggtccc 15780 

tgcgggtcgc acccctgcgg cgtggtcccc ccgggatggg tccaccgagg aggccgctgg 15840 

aggccgagcc cgcgcccgcc cgcggcgcca agatggaggc aggaagcgcc gccgcccgcg 15900 

cccgccaccg cccgcgccgc ccgcctgacg ccgccgttgc gcctgacgcc gccgcccgcg 15960 

cggccgcccc tcccccggcc ctcccctccc cccgccgtaa cgtcctgacg ctccgcaggg 16020 

acccctgact ggacggcggc gcgtgagcgg agcgagaggc ctcgccgcgg gggggccgcg 16080 

ggctcgccgg cgccgcttac ctggggccgc gccgggcctg cttaggcacc cggcgggggc 16140 

ggcggcgtcg ggagctgcgg cggcggcggg cggcggcggc ggccgcgggc ttcgctcctt 16200 

gttggggatt cggcggcggc ggcggcgcgg gcgcgcgctt cctagtgacg caggcggcgg 16260 

ggccgcgcac gcacggggct gggagggccg gacacttatt tggcgctcgc ggaggaggaa 16320 

ggcggggccg tgaaataagg cccgacgggc cccggggcgc gtgcgcggac cgacactgtc 16380 

agctcctaac gccgcaggtt cctcctggtc cccgaggccc ccggtcgggc gttgcctgcc 16440 

ccgcgcgggc ggccgggccg agggacgatg gtcagtggac ggacggcgcc agggagcagt 16500 

gcccacgcgc ggcagggcgg taccttcagg cctccaggta cgggcgctcc tcgcccggac 16560 

gctgctgtgt gtgaatgggc gcgaggggac tcccctgcgg ggcggacgcc tgaacacgag 16620 

gctgtggagg aggacgctgt agggtgcgcg gactcacgcg gaacatgcca gaggctcagc 16680 

cagccacggc gctcccagcg tggagggcga ggggcatccg ggagcggccg ggagggctcg 16740 

gtcacccctc aagctgtcac cccagtccca caaccagcac cccgatccta tcgcagtccc 16800 

acagccgaca ccccgatccc acccctgccc aacagccggc acccacccca atcccatagc 16860 

taacaccccg gtcccaccgc tgtcccacgg ccggcacccc gatcccaccc cagtcccgca 16920 

gctggcaccc cgatcccacc ccagcccaac agctggcacc caccccgatc ccaccgctgt 16980 

cccacagccg gcaccccgat cccaccccag tcccgcagcc ggcaccccga tcccacagcc 17040 

ggcactcacc ccgatcgcat agcatagctg ataccccgat cccaccccag tcccatagcc 17100 

agcaccccga tcccacccca gtcccatagc cagcacctcg atcccataga tgacaccccg 17160 

atcacgcccc agtcctatag cccgcacccc gatcccaccc gagtcccgca gccggcaccc 17220 

catcccaccc atgtcccaca gtcggcaccc cgatcccact cggatccggc agccagcttg 17280 

gatcctgtgg ccctcctcca gcccccaggg ctcatttata tgttttattg gcagaggctg 17340 

gggctggctc tgttggcctc tgtgctgggt ttcttcctct gcaccgcagg actggctctc 17400 

ctgacctctc caggtgtcat cgaacaccct tgtgcttgct gtcacccgct gcctgtctgc 17460 

aggatcccgg attccgtatc aggggaccga aattagtcgg aaaataggaa gcaggtgctc 17520 

gcttggatgg aaccctgacc ctgtgctcac acttgtagga ggagggctct gcaggccgcc 17580 

tcccggaacg ggaggttccc aagccactgc acttcggagg ggctgtaatt agagttgcac 17640 

attcattcag ttcccagtaa agtagaacgt gctccagcca gtgaggaaaa ggtgttttta 17700 

aaaattagat tggccgagtg cggtggctca tgccttttac ctcaacactt tgggagacaa 17760 

aggtgggagg atcacctgtg gccaggagtt caagaccagc ctgggcaaca gagcctgtct 17820 

ctggggaaga ataaaaaaaa aaattgagcc tttgtcagtg ctactatttt attatctggt 17880 

aaatatgaga gggttcacgc ggtctatgtg tgtcatttat ctgagtttgc ctatcgtcac 17940 

gttttggaaa taaatgtcaa taaagtcgaa gaggagtgct gaggggggcc tggggatggg 18000 

agggtggcta catcatgcct gtgtgttgcg caagcccacc gaggtcggcc tggggtgagc 18060 

cctggggcct gttctgcctc cttcactctg gggctccaag agacaaactg ggcaacaaga 18120 

gagaaactcc atctaaaaaa aaagaaaaat cacctccaag ataacttagc tttcttctgc 18180 

tggcataaca aattatctca aacttagtcg cttaaaaatg caaatttagg ctgagtgcgg 18240 

aggctcacgc ccataatcct agcactttgg gaggccaagg caggattgct tgaggccagg 18300 

agttcgagac caacatggcc agaactgtct ctttttaaaa aatgcaaatg tgtccggcac 18360 

ggtggctcac gcctataatc ccagcacttt gtgaggccaa ggcgggcaga tcacgaggtc 18420 

aggagataga gaccatcctg gctaacactg tgaaaccccc tctctactaa aaatacaaaa 18480 

aattagcctg gcgtggtggc aggcgcctgt agtcccagct actcgggagg ctgaggcagg 18540 

agaatggcgt gaacccagga agcggagctt gcagtgagcc gagatggcgc cactgcactc 18600 

cagcctaggc aacagagcaa gactccgtct caaaaaataa ataaataaaa ctgcaaatgt 18660 

attctctaac tgttctgtag gtcggaagtc cagcccagcc tcactccgcc aaaatcaggg 18720 

tgtctgcagg gccgattgct tttggagctc caggggagaa gctgttctgg cctttccagt 18780 

ttctggaagc acttgagccc cttgtctcgt ggcctatccc acacctgaaa gccagccaaa 18840 

gccagttgag tcctcaccct gttggccccg acactgatct cctgcctccc tcatctgctg 18900 

tcaaggcccc ttgtgatgac atggggccac cagctggccc agggcacctc ctgtcagagt 18960 

ccgccgacca gtgaccttca ttccatctgt cgctgtaatt cccctttgct tggaaccaac 19020 

gttcacagat cccaggggtt aggatgtgaa tatcttgggc agggctgtgg gggggctatt 19080 
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cctccttcta aaatatttat catttttgtt ttggggattt ttttggtttg gttttttttg 19140 

agacagagtc tcgctctgtc gcccaggttg gagtgcaatg gtgcaatctc agctcactgc 19200 

aacctctgcc tccgggcaga cgtgagccac tgcaccaggc ctgtttttgt ttttgtttgt 19260 

tttgttttgt ttttgagatg gagtctcggc cgggcgcggt ggctcacgcc tgtaatccca 19320 

gcactttggg aggccgaggc gggcggatca cgaggtcagg agatcgagac catcctggct 19380 

aacacggtga aaccccgtct ctactaaaaa tacaaaaaat tagccgggcg tggtagcggg 19440 

cgcctgtagt cccagctact cgggaggctg aggcaggaga atggcgtgaa cccgggaggc 19500 

ggagcttgca gtgagccgag atcgcgccac tgcactccag cctgggcgac agagcgagac 19560 

tccgtctcaa aaaaaaaaaa aaaaaaaaaa aaaaaaagag atggagtctc actttgtcac 19620 

ccaggctgga gtgtagtggc gggattatag gtacgcgcca tcatgcccag ttactttttg 19680 

tatttttagt agagacaggg ttttaccatg ttggtcagac tggtctcaaa ctcctgatct 19740 

caggtaatcc acccgcctca gcctcccaaa gtgctgggat tacagacgtg agccaccgtg 19800 

tctggccata tttattaact acaaagggaa agatgataat tttttttttt gagatggagt 19860 

ctcactctgt cacccaggct ggagtacaat agcgtgatct tggctcactg aaacctctgc 19920 

ctcccaggtt caagcgattc tcctgcctca gcctcccaac tagctgggat tacaggcgca 19980 

cgctaccaag cccagctaat ttttgtattt ttagtagaaa cggagtttca ccatgttggt 20040 

gaggctggtc tcgaactcct gaccttgtga tctgcccacc tcggcctccc aaagtgctgg 20100 

gattataggc atgagccact gcaaccggct gaaagatggt aattttaaag tagagaaact 20160 

gggttggctg ggcatggtgg cttatgcctg taagctcagc actttggaag tccaaggcaa 20220 

gaggatcgct tgagtccagg agtttgagac cagcctggac aatatagcaa gaccccatct 20280 

ccgcaaaagc taaaaagtta gccaggtgtg gcggcacatg cctgtagtcc cagctactca 20340 

ggaggctgac gtgggaggat cacttgagac caggaggtca aggctgaagt gagctgttat 20400 

tgtgccactg cactcagcct gggcaacaga gcgagagtct gtctccaaag gtaaaaaaag 20460 

gtccaggcac agtggctcac acctgtaatc tcagcacttt gggaggccga ggcgggcaga 20520 

ttcgttgagg tcaggagttc aaaacgagcc tggctaaatg gtgaaacccc gtctctacta 20580 

aaaatacaaa aaaattagcc aggcatggtg acgggcgcct gtaatctcag ctacttggga 20640 

gactgaggca ggagaatcat gtaaacccag gaggctgagg ttgcagcgag ccaagatcat 20700 

gccactgcac ttcagcctgg gcgacagagc aagactgtct caaaacaaaa caaaagaatc 20760 

ttgagtcctg agttcctcta agggaaattc caggcacctc gccacccttg acaggcaaag 20820 

gaacaatctg atgaggaaga agatagaaac agcttaaaca atagtctccc ggccgggggc 20880 

agtggctcac gcctgtaatc tgagcacttt gggaggccga ggcgggtgga tcacaaggtc 20940 

aagagatcaa gaccatcctg gctaacatgg tgaaaccccg tctctactaa aaatacaaaa 21000 

aattagccgg gcgtggtggt gggtgcctgt agtcccagct actcgggagg ctgaggcagg 21060 

agaatggcgt gaacccagga ggcggagctt tcagtgagct gagatcgcgc ctctgcactc 21120 

cagcctgggc gacagagcct cgagactcca tctcaaaaaa aaaaaaaaat tagctgggtg 21180 

tggtggctca cacctgtaat cccagctacg tggcaggctg aggcaggaga atcgcttgaa 21240 

cctgggaggc ggaggttgta gggagctgag atcgcaccac tgcactccag cctgggcaac 21300 

agagcgagac tctgtctcaa aaaaaaaaaa aaaaaacaaa aaaacaatag tctcccaagt 21360 

aagtcagagt cacaaggtgt tttgattccc tgtggaaact aaaatataac agcttaacat 21420 

atgttcttga gttatttttc agaaacttgg acatccacca ggtggaaaat gctgagctag 21480 

gaacagtggc tataatttca gccttttgag aggccaaggt ggaaggatca cttgaggcca 21540 

ggagttagag accagcctgg ccaacatggt gaaaccccgt ctctagtaaa aatacaaata 21600 

ttagctgggc atggtggtgc aacctgaaat cccagctact tgggagacct agctgggagg 21660 

atcgcttgaa cctggtagga ggagtttgca gtgagctgaa attgtgccac tgcactctag 21720 

cctgggcaac agagtgagac tctgtctcaa aaaataaata aataaaaaga gaaaaaagtg 21780 

ttgcctgcag gccgggcaca gtggctcacg cctgtaatcc caacactttg ggaggccgag 21840 

atgggcagat cacctgaggt caggagtgca agaacagcct ggccaacatg gtgaaacccc 21900 

atctctacta aaaatacaaa agttagctgg gtgtgtacat gtagtctcag ctacttggga 21960 

agctgaggca ggagaatctc ttcaaccggg gaggtggagg ttgcgatgag ctgagatcac 22020 

gccaccacac tccatccagc ctgggtgaca gagtgagact ccatctcaaa gcaaaaaaag 22080 

aaacataggt gggacccttg gtgtgtcctt agggcatgat ggttgaggta tactgctggt 22140 

cctgtcatgt aaaagaaaac gagccgactc tgtgtctact ggagaaagca ctgcatatat 22200 

cagccacagt caatacctcg cttctgcagg gacggtggct gccagagtgg gaggctttgg 22260 

tagcacccat gtcgtggaat cacaatgttg tcgatagctc tggggtcttg tacaaaatgc 22320 

cagatcctcc catttggttt ccttatggga aggatcgcag tactataata catgggcttg 22380 

tgcaagggat cattataccc ttttctcttt ttttgctttt ctttgagaca gagtttcact 22440 

ctcgtcaccc aggctggagt gcaatggcgc gatcttggct cactgcaacc tccacctcct 22500 

gggttcaagt gattttcctg gctcagcctt ctgagtagct gggattacac atgcccgcca 22560 

ccaggcctga cttatttttg tatttttagt agagacaggg tttcaccaag ttggtcaggc 22620 

tggtcttgaa ctcctgacct caggtgatcc acccacctcg gcctcccaaa gtgttgggat 22680 

ttcaggcata agccaccagg cccagccttt ctttcttttt aaaattaatc tttgtttaaa 22740 
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aatactctca ttttttattt aattgtagca ctcctagatc ccgaaagcag atacactctt 22800 

gttatgggtc tgattctttt cattgcttca cgccttagag gatattgtcc aatactggat 22860 

aaaagtttac tcaggtctac ttccacttta acggggatgg ctgaatatct cttccacttg 22920 

gctgtttgtt tataatgaac tgacaaacat acaaattttc ttgagttctg tgagacattc 22980 

tagtaaatca tctaacctga agagcaggtt gtgagaaccc ctgatttaga aagcccagtg 23040 

gtcataaata taagtggctc tggactggct cccggggtct gaagtgtggg cagtcggtta 23100 

ggattgagcc cttgtaattt gtaggatctg acacacactc caggaaggca gtgtcagaat 23160 

ttacctgtat tatattggac acccagttag cgtttggaga attggttgct ggtatagaaa 23220 

aataccaaat attttatgtc aggggagtga aagaaaaaac aaaaacccgg ccgggcgcgg 23280 

tggctcacgc ctgtcatccc agcactttgg gaggccgaga cgggcggatc acgaggtcag 23340 

gagatcgaga ccatcctggc taacacggtg aaaccccatc tctactaaaa atacaaaaat 23400 

tagccgggcg tggtggcgcg cgcctgtagt cccagctact cgggaggctg aggcaggaga 23460 

atggcgtgaa cccgggaggc ggagcttgca gtgagcccag atcgcgccac cgcactccag 23520 

cctgggcgac agagcgagac tccgtctcaa aaaaaaaaaa caaaaaaaaa aaacaaaaaa 23580 

aaaaaaccca tacactttaa ggaaagcaac tgacagcatt tgttaccagt gataaaattt 23640 

gagctttgaa gtaagaataa caattttgcc attgtgcccg ggccaagaaa aaaaaaagaa 23700 

ttttgccatt gtgaaaggct tcccagtact ttctgatgag cttgacggtg atattaacaa 23760 

ataacttttt tttttttttt ttgagatggg gtcttgctct gtcacccagg ctggagtgca 23820 

gtggttcaat ctcagctcac tgcaacctcc gcctcccagg ttcaagcgat tctcctgcct 23880 

caacgtccca agtcgctgga ctacaggtgt gcgccaccac gtccagataa tttttgtatt 23940 

tttagtagag atggggtttc accatgttgc ccagactggt ctcaaactcg tgacctcagg 24000 

cgacccgccc acctcggcct cccaaaggtg ggaggccttg ctgggattag aggtatgagc 24060 

cgctgcacct ggcctcttgt ccttgtgttt tgcagtgatg caatgaccat gtcttacatt 24120 

tgcaaccaga aaaaaaggtt agtgtaacaa tgtttatcct gtttttccca gagtagacat 24180 

tatgaagatt aaaaaaattt gaaagtgttt tgaatataat aaactatgct atacacacaa 24240 

cattttggtg actagaaata caagtttatt gtttgttgtt tgttgagaca gggccctgct 24300 

ctgtctccca ggctgggtgg cacaatcatg gctcactaca gtcttgaact cctgggctta 24360 

agcgatcctc ccacctcagc ctccagagta gctgggactg caaacgagca ccaccacgcc 24420 

tggctaatat ttgtattttt tgtagagatg gggtttcacc atgttgccca gactggtctc 24480 

aaactcctgg gctcaagcaa tgctcctgcc tcggcctccc aaagtgctgg gatcacaagt 24540 

atgagccact gcacccggct gagtttctgt tgttttaagc cgcttcattt gtggtacttc 24600 

ttacagcagt cccaggaaac tgagcaactg cagaacatca aaattgtttt tcttcagcaa 24660 

aaggagaagc acttgtggtt ggcaccagct tttcctgtgc tcacttctgc atggccgcac 24720 

ctttgcccga cacgagtgca cagcaggctg tgggggagca actggttgag tcaggcctcc 24780 

acttgtgccg tatccccacc tgctttgctg gacacccctg tttggggggc acccactgct 24840 

gccccagaca ccaagcaagc accagctgtg tccaaaactt acagtcactg tcttggcccg 24900 

ttttgtgctg ctgtaacaga atgccacaga ctgggtaatt taatacagaa cagaaattta 24960 

tttcctcaaa gttttggagg ctgggaagtc caagagcaag gggccatcag gtcagggcct 25020 

ggtctctgct tccacgatgg caccttgacc accgtgtcct cacgtggtca gagagagccc 25080 

actcccagga gcccttttaa tagagcagaa cactgctgcg ctgcggttaa gtttccaaca 25140 

cgtgaacttc ggaggtgaca cattcagatc atagcagtca ctctaggcag agtgtctgat 25200 

gtggttttaa aatacgttca cagactggcc gggcactgta gctcacgtct gtaatcccaa 25260 

cagtttggga ggccaaggtg ggtggatcac ctgaggtcag gagttcaaga ccagcctcac 25320 

caacatggtg aaaccccatc tctactaaaa atacaaaatt agccaggtgg tgcatgcctg 25380 

taatcccagc tactcgggag gccgaggctg gagaatcgct tgaatccagg aggtggaggt 25440 

tacagtgagt cgagatcatg ccattgcact ccagcctggg caacaagagc gaaactctgt 25500 

ctcaaaaaat aaaataaaat aaaatacatt cacaaggccg ggcactgtgg ctcacgcctg 25560 

taatcccagc tacttgggag actgaggcag gagaatcgct tataacctgg gaggtggagg 25620 

ttgcagtgag ctgagatcac accgctacac tctagcttgg gcaacaagag tgaaactccg 25680 

tctcaaaaaa gtaaaataag gccctgcagg catggtggcc cacacctgta atcccagcac 25740 

tttaggaggc caaggcggtc ggatcacgag gtcaggagtt cgagaccagc ctggccaaca 25800 

tgatgaaacc ccgtctctac tagcctagcc aacatgggga aaccctgtct ctactaaaaa 25860 

tacaaaaatt agccgggcat ggtggtgcgt gcctgtaatc ccagctactc aggaggctga 25920 

ggcaggagaa tcgcttgaac ccaggaagca gagggtgcag tgagccaaga ttgcgccgct 25980 

gctctctagc ctgggcgaca gagcgagact ccatctctaa ataaataaat aaaataagaa 26040 

aataaaatat gttcacaaat cctttgacat tcctcacctc aaaagctgga acccaactcc 26100 

ctcctaagca tgagtcttct cagtgactca cttctaacag cagaacttac atggttcccc 26160 

acacccagag gacattgggt tcctcccaat atccccccac ccagcgaccc ccacccaggt 26220 

cgctggcttt gggtccccca gagccatgtt tcaaggacac tcaggcagcc cctggatgtc 26280 

catgtggtaa ggaatgaagg cctcctgcct gcagcctcgg gagggagcat tctcagaaga 26340 

ggatgcccca cctcctgccc agccttcaga tggccaggac ctcgtccaac gtcctgactg 26400 



16 



WO 01/42434 



PCT/US00/33065 



caacatcatg agagactccg agccagaaac ccccaggttt tgtactcctg acttatggga 26460 

actgacagat aatgttcgtt gttaattaag gggtgacttg tcacacacaa taggtcacta 26520 

aacagctctg tctggcctcc caggaggagc ctgcctttcc ttttcttcat gggaaaagtg 26580 

cgatcagttt gtgaaggaat gtccgccccc acttgatgcc agaggctcca catggtgact 26640 

gtcataaact ccatctgccc tcagtgcctt gccagcaccc ggcctgcgat cagcttggtc 26700 

ttgcgggagg ccaaggccca cgtgtgtttg tgtgtggtgt ctgtgtctgc gtgcccatgc 26760 

atgcccaggg tacagggatg ccatatacaa attctttcaa tgttgtatgt ggcatgtgtg 26820 

tgtctgtatg cccaggatac agggatgcta tatacaaact ctgttttttc gttttttttt 26880 

ttttgagaca gagtcttgct gtttcgccca ggccggactg cagtggcgct atctcggctc 26940 

actgcaagct ccacctcccg ggttcacgcc atcctcctgc ctcagcctcc tgagtagctg 27000 

gaactacagg cgcccgccac cacacccggc taattttttg tatttttagt agagacgggg 27060 

tttcaccatg ttagccagga tggtcttgat ctcctgacct cgtgatccac ccgcctcagc 27120 

ctcccaaagt gctgggatta caggcatgag ccaccacgcc tggcctacaa actctttctt 27180 

tttttttttt ttttttttga gatggagtct cactgtcttc caggctggag tgcagtgatg 27240 

cgatctcagc tcactgcaag ctccacctcc cgggttcatg ccattctcct gcctcagcct 27300 

cccaagtagc tgggactaca ggcacacacc accacgccca gctaattttt tgtgttttta 27360 

gcagagatgg ggtttcacca tgttagccag gatggtctcg atctcctgac ctcgtgatcc 27420 

gcccgcctcg gcctcccaaa gtgctgggat tacaggcgtg agccactgcg cccagcctgc 27480 

aaactctttc aatgtctttc ttttctctct cctgccatct tctcccttgc agatttcttt 27540 

tgtctctacg tcttccccag ctgagtccga ggtcctgact tgcccacgct ccctggactg 27600 

gaggagaggt gatagcaaga gctccttcaa gcccaggaat gccaccaggg ctgccccggg 27660 

agaggaggaa gctgggtctc tcggggttgt ggggaccaga cacccttcta agacatggac 27720 

tcagcacaga aagtctagac atccactaca aacacatctc cctcctaaca gggggcccct 27780 

gggcacccca agtggctgtt tggtgggaca ggcatgtcca tcagtcagaa tatctttatt 27840 

ttttattttt tattttttat ttttgagaga gtttcactgg agtgcaatgg cacgatctca 27900 

gctccctaca acctccgcct cccaggttca agcgattctc ctgcctcagc ctgccacgta 27960 

gctgggatta caggtgtgag ccaccacacc cagctaattt tttttttttt tttttgagat 28020 

ggagtctcga ggctctgtcg cccaggctgg agtgcagagg cgcgatctca gctcactgaa 28080 

agctccgcct cctgggttca cgccattctc ctgcctcagc ctcccgagta gctgggatta 28140 

caggcatgag ccaccgcgcc cggccaattt tgtattttta gtagagacag ggtttcacca 28200 

tgttggtcag gctggtcttg aactcctgac ctcaggtgat ccacctccct cggcctccca 28260 

aagtgctggg attacaggcc tgagccacca cgcccagccc agaatgtctt cttacttttt 28320 

attactctgt cccccatcct gggtccagac ctgtgaccgt gaacaaccgg ctgcccaggg 28380 

gtgaatgggg tgagtggggt gagtccacag aacagtgggg tgcagcccca ggggtctcgt 28440 

agcacctgcc cccaggtcag gaagtcccac agcctagagg ctccagcctc agatgcatac 28500 

atatgtaggc cctgcccttt cctcctgagc ggcgggccac agagtcctga acaacaggaa 28560 

gcccctgagg agggctccgc cctgagggag ggcaggggag cccccgccag ccccacccac 28620 

agcagcgggc cctgccaccc cccaccctga cacctcaccc cttggattcc agagaggaaa 28680 

gtgggcttgt gtgtagttta catgctcata tcttaaaatc accgttgtca atagaacaat 28740 

tcataataat gatgataaaa taagatttat aaccagcttc agtctggaga tacacacaga 28800 

gcagatcttc actcccagac agggagcccg cagctgcccc cgaccccaca ggtgcaggac 28860 

acacacagac agttcaacca tgtcttaaac acacaggtgt ttatttaatt gttcatttga 28920 

ttgaattttt aagttcactt tactacgtgg atgagatggg tgcatattac agtaggcttt 28980 

cgctatgagc gctgccacca tgaggaatat cccagccctc agttctgctt ccctttctga 29040 

gtcccacaaa agccagatgt ggacagcctt gggttcccat cccagctggc tgctccttct 29100 

ggggctgtct tggtggggag agggagatgg ggcagtgggt ccctgctgac ccctgagccc 29160 

tgcaggggtc aggatcctcc cgtggtccct gggtgtggct ctggaagaca ctggcagtgc 29220 

ccggccaagg cctcccgcag gatggaagtt gagggccctg gctctgggtc ctaagagaac 29280 

tcagccgccc ccttcacact ttacagcaag gggccaggca gcagctttgg gatggggctt 29340 

ccgtggagaa gtgggggatg ctgcagtggt acaaagacag cctcccccac cgccatcctc 29400 

cagctgaccg tcctccaagg ccagcactgg gcgtccaagg gaaagaagga actcagccca 29460 

gagggtgtgg gcaggagagg cctggagtca ggcctccacc cacagccccc tctgggtgcc 29520 

aagtgggaag ggtgttgggg ctggcttggg aaccttaccc gctgcccttc caacacctgg 29580 

atctgtgggc agcggtccca caaaatcccc cttggggctc cctgaggagg acttgtggct 29640 

gccgcttcca ccagggcaga gggcacagga ggggccagca ctccaaaggg ctctagggtg 29700 

ggtctttcaa ggacatctgc aaagccctgg tggggagggg cctgggccag aggctctttg 29760 

gaactcttgc acttctgagt gggggactgt ccatgctgcc cacaacctct agaccatgca 29820 

gcctgctcat gggtccctgg cagagaatgc ccactcccca gcagactcag ggcaggcccc 29880 

caactgcagg cttccaggaa ggcccagggt gtccacctca cgccaggtgg tctcagagga 29940 

cccctgtgca accacattaa ggaaagctgc agcccccacc cacccgcctg ccagttcaac 30000 

aagcaccggc tgcacacgca ggctcccagg caccatcacc cccctccccc gtcgcccctc 30060 
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cctcacgggg agccccttcc ccctggaaag acagcaggta ctgtagcctc gcctgctggc 30120 

caggggcgcc ggctcagagg acctgccctg acctgcacgt gctgaccaga cagcccagcg 30180 

taaggacccg cgatcccacg ccaccgccct gggtttacca cggtcaccac cacctctctc 30240 

acagggcccc cgggggaccc agccgcgccc ggcctggtgt ctgcaccgag ggaccgcgtc 30300 

tcacgcccgg cggctcctgc aggggaagcc gtggtcagcg actcaccacg aggacagggc 30360 

agggcggctg agtgcggaag agaagcatga agctgggggc gggggtgggg gaggaggaac 30420 

aaaagttgca tctagacaga ggtgaacgaa acaaaaccaa aacccgaacg tgttccgtcg 30480 

caggatgggc gccgcccgtc ccgggccctt agcccgacat ctcttctcgc tgctccttgt 30540 

tcctgcgcac ctcggccgcg tgcagctcct gcaggacagg gggcgggagg gcctgagggc 30600 

gggggtggct tggggcgact ccgggaaccc ccaggcgcgc aggccgtggc gccctggcac 30660 

ccgcccggcc tcatccgggc tggccttcgg caggaccctg actgagttga gggggcggga 30720 

gcaccgggga ggcgcagagc aaggccaggg accaaggacg ggtttcctgg gagctggctg 30780 

ggccccgctt ctagctcgta ccggagccga gcttccttca gggcactttc aatataatga 30840 

atttagccat ctattactgc ggctagttac tgtcccgcca ggaccagact ctggacctgc 30900 

ctcgtgcgct gctggggacg cccagtaaac acgggaggag cccccgaccc ccaccccagc 30960 

tcagcgcctc ggagtccccg gccccgctct gcgcccctcc gagctccgcc ctagccccgc 31020 

ccccgcccag tgccccgccc cctgcctgct gctagccctg cccccgcccc ggcccctgcc 31080 

cgctccgagc tccgccctgg ccccgccccg gcccctgccc gctccgagct ccgccctggc 31140 

cccgcccccc gcccagtgcc ccgccccctg cctgctgcta gccctgcccc cgccccggcc 31200 

cctgcccgct ccgagctccg ccccggcccc gccccggccc ctgcccgctc cgagctccgc 31260 

cctggccccg cccccgccca gtgccccgcc ccctgactgc tgctagccct gcccccgccc 31320 

cggcccctgc ccgctccgag ctccgccccg gccccgcccc ggcccctgcc cgctccgagc 31380 

tccgccccgg ccccgccccg gcccctgccc gctccgagct tcgccccggc cccgccccgg 31440 

cccctgcccg ctccgagctc cgccccggcc ccgcccccgc accttctcgc gcagccgctc 31500 

gcgcagtgcg gccaggtgtg cctcgcggat ctccttgctg agctccatct tgtagttgag 31560 

cttctcctcc gcctggcggc tgaagttgtt attctcctcc agcgccttgt gcagcacctc 31620 

gcgctcgtgc tcgcgccgct ccgccagctg cttcagcacc tgcgcctcct gcgtctgtgc 31680 

ggggccggcg ggcgcgcgtg agcggcaacc ccgggccctg cccggccgga ctcctccctg 31740 

ctctccgcct cccgcccagc gcccgctcgc ctcacctggc gcctccacct gcccaggcct 31800 

cggtgggcgc cgggaccccc gggcgctgcc ctgggaaccc tcgcctgcca tccggcctgt 31860 

ggtcggggca gggccagggg gtcgcgatcc gccgcccccg cccccgtccc tgcctcgcgc 31920 

gcgggtcccg cggtcctggc tgcgcccagg gcccccgcca taccctgccg ccactgcaca 31980 

ccctgccctg cgcgtctgcc cctccaagga ccagcagcaa gaaaccctaa acttgtgggc 32040 

ggtctctgag ctttgtctct tcctcggaca tccgcccact gagcagagta gctgcttgtt 32100 

acacaccggg ttcccagctc ccaattaggt gcccaggagc ggagggtccc cagggatgct 32160 

gggggagggg ccggctggtg acccctggga ggagagcggg gcagcaggac ccgcacccac 32220 

atgccagtcc ctactagtca gccctgtgaa ccctggtctc tggcctcacc gggaagggaa 32280 

cggagccgct tcccctgccc aatgcgttgg cctccagggt ggcaccccca aaaggacatt 32340 

tttatctctg tttcagtctc agaggggctg gtgggagggg aggctgcagg gaggggacct 32400 

ggagcccaca cccacctctc ccagggcccc tccgccctcc agcaagcctc agggtcttca 32460 

cacatgaggc ccttcctcca gcttccctgt ctgggagagg gatgccccac ccgacgtccc 32520 

cagggcccat ctggggacca ccccctagca tcctgctggc cctgacaagg gtgcctccca 32580 

ccctcaccag aggctcctgc tccttccagg tggccgcctc ggaacccttc ctcctctcca 32 640 

tccctttctt tttttgttct tgtttgtttt ttgaaatgga gtctcaccct . gtcgcccggg 32700 

ctgaggagtg cagtggcgca gtctcggctc actgcatcct ccacttcttg ggttcaagca 32760 

attcccctgc ctcagactcc ctagtaggtg ggattacagg tgtgcaccac cacacctggc 32820 

taattttgta tttttagtac agatggggtt tcaccatgtt ggccaggctg atcttgaact 32880 

tccaacctca agtgatctgc ctgcctcagc ttcccaaagt tctgggatta caggcgtgag 32940 

ccaccacacc cggcctctcc ccatcccatt cttatctctc agaaagaggc ccagggagcc 33000 

acagcccctc ctgctccagg ccaaggcact gaccaagcct gtccgggagc accctgcttc 33060 

ttgcaggccc tgtccccgtg ggccgcctcc gttgaaactc ctggggggtg ggggatggag 33120 

gactccttgc cttcctccgc tcctcggctg cctccagccg cttttgcagc tcctccaggg 33180 

aggtgtcctt cttcttgggt ggggaggaga gcatagggct ctctggggac aggtcagaag 33240 

gggacttgag gatgacctcg aagctctggc ctgaggcccg cttgtccagc tgcttcacct 33300 

ccatgtctgc agggcaagac cagagtagag cttcagaggc ccggccaggg catggcgtgg 33360 

gctgagcggg atgctcccag cacacatcca accccagggc tgggcgagag ggggtggctg 33420 

ctcccgcagg aatcccaggc ttcagccccc aggatgggcc ccttccccct agaacctccc 33480 

tctccagagg cagccaggac gggagttcag agagactgcc ggaggccggg ggaaaaggtg 33540 

aggtgggcag gcaccgcagg gaagggcagg cggcagccag gcactcaccc ccgtactggt 33600 

agacggtatt ggggtgcggc tgtgtgtaga agcaggagca gatgagcgac agcaccgaca 33660 

gctccttcat cttctccttg taggctgtgg gcacaaggct gggctgagca agcaccactg 33720 
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gggcctgccc acctgggccc ccgttttccc tccccatggc tgcctctatc atgtctctgt 33780 

gagacacgga gctgcccagc acgctctctt gtgtgtctcc acaccgccgg ccccttcgtc 33840 

tctccagctc tctcgcttcc agacgtcggc actgtctccg tggtgtgtcc cctgccctct 33900 

gtctctctcg ccctctgcct ctccccgctt ttcctctctc tcggcattaa tgtctgtctc 33960 

atcttccaca ctgacttgtt tctccatcct tctcctgcct gctgtggtct gaatgtttcc 34020 

attacccaaa actcatgtgt tgaaatcgta accccaaggt gccggtgtgc ggaggtgagg 34080 

cattcggagg gaattaggcc atgaggatag agccctccta agtggcccca gagtggggct 34140 

tcagagaact ccctcacctt ccatcatgtg aggacacagc cagaagacgc cacccgtcta 34200 

tgtaccagga ggcgagacct ctccaggcac cgactctgcc ggcaccttga tcctggactt 34260 

tctggcctcc agagcgatgg gaaataagtt cctgtcgtct ataaaccact cagtctcagg 34320 

tacctgccca gactgacaaa gtggctaccc ctgcctgtct gggtctctgt ttaccttctg 34380 

tgtgtctgac tctgtcactg tcattgtatc tttctgtgtc tctgggggta gcccctgact 34440 

ctgtctttct ccctgagtgc atctttctgt gattccttgt cactgtgtgt ctttctgact 34500 

cttacctccc tctgtcccgc tacttctctc tcccctcctc ctccttccca ctcctcgcca 34560 

gctcaagcag gcaagattta ctcatgacgg gaccagcaca gatgcaaacc ctctgtgggc 34620 

aggactttct tgggctgtaa acctggatga agccctcaga ccctcctttt tccttcccaa 34680 

tgattgtgtg gtcaccttga gatgaaacca ggccctctcc aggcacatgc tctctgtcta 34740 

tctagggctg ggcttgggcc actgatgcca ccaaggagca agggagggaa gctgtccgtt 34800 

cagcaccaca gccagccctc ttgcccattc aggtcaatca agtgcccacc agccagtgtc 34860 

cctgctgccc aacccaaacc agaagcaagc cgggctcctg tggccctgtg ccctgtcagg 34920 

ggaagaggaa ggcgcctgct gtcacagtga aaataattta gctcttttgg tctattcagg 34980 

gcgaacctca ttcctaagca gacacgctgg cccggtttct cactagtgct cgataatcct 35040 

tttggctggg tgcagtggct catttaactg taatcccagc actttgggag gccaaggcag 35100 

gtggaacacc tgaggtcagg agtttgagac cagcctgacc aacatggtga aacccgatct 35160 

ctactaaaaa tataaaaatt agccaggcgt ggtggcaggc acctgtaatc ctagctactt 35220 

gggaggctga ggcaggagaa tcgcttgaac ctgggaggcg gaggttgcag tgagccgagg 35280 

tcgcgccatc gcactccagc ctgggtgaca gtgtgagact ccgtctcaaa acagaaagaa 35340 

aaagagagag aggaagaaag gaaggaggga gggagggagg aaaagaagaa aggaaaggaa 35400 

aggaagacag acaaggcaga agtaatcaag cctttcatgg tgagctgggt cttctggtga 35460 

cagtgcagag aatggtctgt cctgacttaa atttcctggt gacctacact tttctggaca 35520 

gagcagcaca gagcccaaga gggtgtaagg aggagcagaa aggaatccca gggtgggcag 35580 

gcccgtgcga gagcctttgg gggaaggaat gagactttga gccgggaagc gaggcaaagc 35640 

tacctgtctt ggtcattgtc ttcagggagg gagatggagg gggaccaggt gggggagcct 35700 

cacaggggac tttggtctga cttgtcaagt tttctttttt tctttttgag atggagtctt 35760 

gcactgttgc ccaggctgca gtgcagtggt gcgatctcgg ctcaccgcaa gctccgcctc 35820 

ctgggttcac accattctcc tgcctcagcc tcccgagtag ctgggaccac aggcaccgcc 35880 

accacaccca gctaattttt tgtattttta gtagagacgg ggtttcacta tattagccag 35940 

gatagtctcg atctcctgac ctcgtgatcc gcccgcctcg acctcccaaa gtgctgggat 36000 

tacaggtgtg agccactgtg cctggcctac tttatttttt agaaacagga ctgtgctctg 36060 

ttgcccatgc tggagtgtag ggtgcagctg tgcggttcac tgcagccttg aacttctggg 36120 

cttgacggat cctgccatct tagcagctgg gactacaggt gcatgccagc acaccagttt 36180 

tctttttttt tttatctctg ctcactgcaa ttccgcctcc tgggttctag cgattctcct 36240 

gcctcagcct cccaagtagc agggattaca cgcacatgcc accacacccg gctaattttt 36300 

gtatttttag tagagacagg gtttcactat gttggtcagg ctggtcttga gccaccgcgc 36360 

ccgcccggcc tacacaccag cttaaaaaaa agaaaaaaat agctgggcgt ggtggctcat 36420 

gcctgtaatc ccagcacttt gggaggctga ggcaggcaga tcacctgagg tcaggagttc 36480 

aagaccaacc tggccaacat ggcgaaaccc tgtctctact acaaatataa aaatcagcca 36540 

ggcgtggtgg cgggctcctc taattccagc tacttgggag gctgaggcag gagaatcact 36600 

tgaacccggg aggtggaggt tgaagtgagc caagatcgag ctactgcact ccagcctggg 36660 

agcaagactc ccgtctcaaa aaaaaaaaaa aaatttgtag tggtatggag gccgggcatg 3 6720 

gtggctcacg cctgtaatcc cagaactttg aggggccaag gcgggcagat catgaggtca 36780 

ggagttcgag accagcctga ccaacatgat gaaaccctgt ctctactaaa aataacaaaa 36840 

attagccagg catggtggcg ggcacgtgta gtcccagcta ctcgggagac tgagacggga 36900 

gaatcgcttg aacccaggag gcagaggttg cagtgagctg agatcacgcc actgcactcc 36960 

agcctgggtg acagagtgag actctgtctc aaaaacaaac acaaacaaac atatatatat 37020 

atacatgtat atatataata tatatatacg tatatataca cgtgtatata tataatatat 37080 

atacgtatat atacacgtgt atatataata tatatacgta tatatgtata tattaatata 37140 

tatacgtata tatacacgtg tatatattaa tatatatacg tatatataca cgtgtgtata 37200 

tattaatata tatacgtata tatgtgtgtg tgtgtatata tatatgtata tatatatata 37260 

tatatacata tatatataca gagagagaga gagtagtgat aggtcttgct gtcttgtcca 37320 

ggctgatctt gaactcccgg cctcaagaga ccctcccacc tcagcctccc aaagcactag 37380 
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gattataggt gtaagccaca gtacctagcc 
tgatgaggga gttagagggt gtgccagcca 
tggggacatt tagaggagct gaagaggtgg 
gagagagcaa gagggagttt gggctggggc 
aggtagggac ttggcccctc caggcctgac 
ctccaggccc ctacccttca gctcatcctt 
gccgggcgca gtggctcacg cctgtaatcc 
cgcttgagaa caagagtttg agaccagcct 
aaaatataaa aattagctgg gtgtggtggc 
ctgaggcagg agaatcactt gagcctggaa 
cactgcgttc cagcctgggc aacacagcga 
gtgtttacca taaaggccag aaaatgtgat 
aagcctgaaa ttttcaccga aaaaaagatc 
cagggaggct ctcttccagt ttgcttcacc 
gggcaggtgt gagcacctgc atcatcagga 
acctctcaca ggtcttggct ctgcccagga 
gggttctgca gctgcagaca caggtgcggc 
gcttggcttc tcccagctgg tggcccatga 
aggagcaagt gggaccaggt acccacagga 
tctggctaca gggaggaagg gcagattgga 
aactctggag agagcaagca aggggaagtc 
ggcagggcat tggaccagta ttttcagagc 
tcttattaca gagagaggat gccgaggccc 
cagctggtgc cagggaagcc cctcccagag 
tttttgtttt tttggagacg gagtctcact 
atctcggctc acggcaagct ccgcctcccg 
cgaatagctg ggactacagg cgcccgccac 
agagacgggg tttcactgtg ttagccagga 
ccgtctcagc ctcccaaagt gctgggatta 
ttttttttta agacggggtc tcactctgtc 
ggctcactgc aacctctgcc tccccagttc 
agctagaatt acaggcacat gccaccatgc 
tgaggtttca ccatgttggc caggctggtc 
cctcagcctc ccaaagtgct gggatgacag 
tgctgggatt acaggcgtga gcccccgcgc 
ttctcaccgc cccctcccca ccatccttcc 
ttctcgcagt gagctgggct tgttttgtct 
accgaggaaa tgcggattgg cctttggatg 
aggttggcct caagcagcag ggtatagtag 
acccctcccg ccgtccaggt ggatggtgcc 
ctcccgccgt ccaggtggat ggtgcccact 
cgtcgcccag gtggatggtg cccactccca 
cacccgggtg gatggtgccc gctcccaggg 
gcccgctccc agggtcacac ctcacgccca 
cgctcccagg gtcacacctc acgcccaccc 
cccagggtca cacctcacgc ccacccctcc 
gggtcacacc tcacacccac ccctcccgcc 
ttctccttct ctttcgtctt cttcgtcttc 
tagaaagagt ttctactctt gctgcccagg 
tgcaacctcc ctctccccgg gtcaagcaat 
atcacaggag tgtgtcacca cacctggcta 
tcaccatgtt ggccaggcta gtctcgaact 
ctctcaaagt gctgggatta caggcatgag 
ggaatgaagg ctcagagagg gcaagcattc 
agcccagtca tgagcctggg cctgactgca 
gatctgtcca gaactgagag gccaggggac 
ggaggctgct gccggctccc cccgggggca 
tcatggtgcc aggagcacag catggcagga 
ggaaactcag aaatgtggct accacagtca 
cttagcaggc aagcggccaa gcacctttcc 
ggagaggtgc cagcaccgtc atctctaccc 



tattaaaaat taatgttaaa caagaggatg 37440 

tgtgttccac agcagcaggt caggagacat 37500 

ccaaccctgt gctcaggagg acgggggagg 37560 

agaacgtacc tgggtcctga gaggataaga 37620 

tctgccagca accagctccc tatcagcaga 37680 

ccttatcaca catccaaaac tctgaatgtg 37740 

cagaactttg ggaggctgag gcaggaggat 37800 

aggcaacatg gtgaaacccc atctctacta 37860 

acatgcctgt tgccccagct actcaggagg 37920 

ggcggaagtt gtagtgagca gagattgtgc 37980 

gactctgtct caaaaaacaa aaactggaat 38040 

taacagctgc tcaaagcccc tgtctgccct 38100 

tgtaggctca tacagaggaa ggacaaacac 38160 

tcagcaagca gacggctggc agcaatttgg 38220 

aagaaggggc acggtgggga cgcaggtcag 38280 

gacacgtgtc caactgagag gtgaggaact 38340 

tcagcatctg atggccacgg agaccccctg 38400 

ggagcttcta tcccaagaga ctgtccctca 38460 

cggagcctgg gagtgaggcc tgccctgtgg 38520 

gggggcagga cagcaggtca ggaattggcc 38580 

tgcgcacagg gcagggctgg tcaggggcga 38640 

tggtgaggct taaagagcat gtctactgcc 38700 

agacccatcc aggccacctc tccacagaca 38760 

cctcaaggca ttgctccctc tctctctctc 38820 

ctgtctccca ggctggagtg cagtggtaca 38880 

gattcacgcc attctcctgc ctcagcctcc 38940 

cacgcccagc taattttttg tatttttagt 39000 

tggtctcgat ctcctgacct tgtgatccgc 39060 

caggtgtgag ccaccgcgcc tggacttttt 39120 

acccaggctg gagtgcagtg gcgcgatgtc 39180 

aagtgattct cctgcctcag cctcccaagt 39240 

ccagctaatt ttctgtattt ttagtagaga 39300 

ttgaactcct gacctccggt gatctgccca 39360 

gcgtgagccc ccgcgcctgg ccccccgcag 39420 

ccggcccctc cctctctttg actcccttct 39480 

ccttcactga cttcagggag ttaaaaacaa 39540 

ccctgcttct ctttgtacta aacattagat 39600 

attcatgagc aggagtcaga aaaaggcacc 39660 

tgcccgctcc cagggtcaca cctcacgccc 39720 

cactcccagg gtcacacctc acgcccaccc 39780 

cccagggtca cacctcacgc ccacccctcc 39840 

gggtcacacc tcacgcccgc ccctcccacc 39900 

tcacacctga cgcccacccg ggtggatggt 39960 

cccctcccgc ccgcccgggt ggatggtgcc 40020 

ctcccgccgt ccaggtggat ggtgcccact 40080 

cgccgcccag gtggatggtg cccactccca 40140 

cacccgggtg gatgccctta tcagctctcc 40200 

ctcctcttct ttcttctttt tttttttttt 40260 

ctggagtgca atggcacaat ctcagctcac 40320 

tatcctgcct cagtctccca gattgctggg 40380 

attttgtact tttagcagag aggggggatt 40440 

cttgacctca gtttatccac cggcctcagc 40500 

ccaccctatc tgcctcactt ctacagagga 40560 

cacccagcat cacacagagt gccgggtgag 40620 

ggctcctgtt gggagctcgc ggaggtgggg 40680 

cacagtggcc tctgacccct ggagggccct 40740 

gatggaggtc actgtcaccc aggctgcttc 40800 

gccaccagcc gatttgcctt tccctgggca 40860 

ggctgcttga cgtgcggtga gcactcatct 40920 

tgaaatattg aggcctcaga acaagcccca 40980 

agataaggag acccaggtcc tgagaggtta 41040 
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ggcagctcgg acaacaccac acagctggag gaggtcagac tctgggttgc agaaggagaa 41100 

tgtgagcaga ggccacaaaa gagcgaggag ccagtgccca gatgccgaga tgccctcgcc 41160 

ctcccagctc agccccagga accgagccca tggggaggga ccgtcaggga aaggctgtca 41220 

ggaagggcag gaggcggccc tggagaggac ggcgctgccc tcaggggcag gaggggagtc 41280 

ccctccgctg agagcccccc cacccccagt atccccgggg gtgtccagga ggaggcggag 41340 

ggaggaagcg cagatggaca ggactcccag atagggtggg gaggtgtggc cggtgacaca 41400 

cacggtcccc tcctggcagg tgctgaagtc acctggagcc tccaagcccg tggggcctga 41460 

ggggcggggt caggtcgggc acgcgtgggt gggcggagtt ctgcgccccg ggccaaggcg 41520 

cccgagttga accagtcagc tcgggagagg gaccgcggcg acctgtcccg ggggcgtaag 41580 

aaaaggtggg agggagtgcg gctcgtgaac gggggcggcg atgggaagga ggtgcggccc 41640 

ttcgtcctgt cctcccaaac gtcgagtgaa aaacgaagcg ggttctgcgg cctcgcggcg 41700 

gagcagagcg tttcgggaag ggcgggccca gcgtcctcgc gcccgaggtc gcccggcagc 41760 

tcccctgcgt ccagaatccg ccccccgccc gggcctgcgc ccgcccctcc gcctgagctc 41820 

cgcgcgggac gggccgggag gccggggtgg gcgctacctt cgaaggcggt gggtccgccc 41880 

cgcgggaggt ggaggggcgg gaggggcgga gccctctggt ctccggaggg tttggggatc 41940 

gcagtcgccc ctcccccatc cagaccccgc ggcgcaaagg gcagtggctt ttctggccag 42000 

agcaggtggc gcgggcgtcg caaagggtgg tccccgaggc cgcagcggtg tggggggagg 42060 

gcgcggtccc cctcactccg ggctccgccg tgtctggccc gcccccctcc ttcagcgccc 42120 

cctccagccc ctgtgctgca ctggcgcggg gagcgccggg ttcccggctg gggctttggc 42180 

agagggtccc accctctccc cgcctcccca cgaaggctct ggcggaccca gatctcgggt 42240 

cgccggacgc cccagggacc ccgcccgcac atcgcgagcg cgcccacccg gtcgcgagcc 42300 

cacgcccggg tctgggagcc accctgcggc agtcgcgccc tgcgtggcac gctgctcccc 42360 

caggggcgag gcgcccccgc ccgacgtccc ggtcccgagc gctccccggc gcggcgcctc 42420 

gcagcccagc gccccaccag ccccgccggc gccgcagacc ccagcctcgg gcgggtcggg 42480 

cccaggcttg caacgcgcag ggtaggagaa gggaaattgg cgtccgctgc cggccgctgc 42540 

cccaggcgag gccagacgag gcctctgctc agatcccgcc gccccacaaa gcccgtggcc 42600 

ccggagccta ccggaaatgg tgctggccat ggtgctggcg gcggttgggc ctgcggaggc 42660 

tggagaggcg caagtggcgg ccggagctgc agacggctgg tgctgcagtg ccggggaggg 42720 

gaggggagag gagtggaggg agcgagggcg ggcgggaggc gggcgcggcg ggagagagag 42780 

agggagggag acagagggag agagagagag ggttggggga aggagcgggg ggaggaggga 42840 

gggagggttg ggggaaggag agagagagag agagagactg cgggggcggg ggaaggaggg 42900 

agggaggaag ggagggagga agagagagag gagcaagcgc ctggctgcgg aaggggccgc 42960 

ggctctcagg gggagagggc ggaggagggg ggctacccga actgcaacaa gaccccccac 43020 

cctccaaccg ctcacagcgg gacagctgct tctccaactt ggctttgtga ggcctgagag 43080 

tggggtgggg gtggagatga gcccccattc cccagggcag gcggggcagg ggcaatgccg 43140 

gaggagcagg tcccacccat ggggtggggc cgcagagctc ttcgccgcca aggccgctgt 43200 

aggctgggct ggcgccaaca gggtccaggt ctgtgcctgc catcggagag gatgccacag 43260 

ccacaggggt gggcgctggc ctggaggcct ccaaggggca tctcctgtga gcccagggga 43320 

tgggcaggat ctgagcggag aagagtgaaa gtggaggagt gaggccagaa caaaggcttt 43380 

gccgtgaaag aggtggtttc ccgcctgggc tcagaccttc actcactgtg tggcccaggc 43440 

caagggcaag cgtctgacct cgctgggcct ttgtttctca ggggtaagat gaaacaatga 43500 

tgcccccaga cgatggagag gaggggtgcc agggttgtgc gcacttagtg agtggggggc 43560 

aacctatcct gcctccccct ctcctcataa ctcccaaagg gaaagcctgg taggcaaacg 43620 

gagcgtcttt gccattgcag ggatgaagcc accgaggcag ggagaaaagt gctttgccct 43680 

acaagcaact aagtcatagg gccaggagca aaaccctgaa aacctcagga gacttgcaga 43740 

gccatgaggc tggctcagca acacaaaagc caggggcaag cctcagctct agcagtgcgg 43800 

tgggagcacc caaggccagt cacatcctag ggtggcctgg agagtcctga cccctgacgt 43860 

gcaagccggc atcatccccg ggactgtgag tctggtgggg gtgatgccca ggaatgtgac 43920 

attgtgtggc ccagaggtac ccttaagact ggaggatcac caggcgggcc ctgacctcat 43980 

cacaggagcc ctttaaaagc agtttccttt gcctggttga agaaatcgga gggatcaaac 44040 

caaagaaggt tttctgttgt tgagatgagg gggccacgtg gcaaggatct gagaactgct 44100 

cccagccaac agccagcaag acaacaagac cttaactgca aggaagtgag ttctgccaac 44160 

aagaagagaa tgggcttgga ggcaggtttg accccagggc ctccacacaa gaactgagcc 44220 

caactgccca cttggtttca gccttgggtt actaagaatt aggaggtaat gaatgagagt 44280 

tgttttaagc tgttggtttt gtggtgattt gctatgaagc catatcaaac taatatacac 44340 

acagaggtgt tggcccctgg gccattccta ggaagccagc tctgcgaagg aggaagaagg 44400 

gcagagaggc acacagagct gcccaccaca gcagctgtgt cctccctgtt ggccaccaca 44460 

gtagcagttg gggatggtca gcatccttca ggcagactcc agccccgggt gctggagctc 44520 

aggtgctagg gatcaagaga agtagccctc tctgggacct ccagagtctt ctcatgtggg 44580 

tggggtagga cccacccagt caggctcaga gcaccgcaat gcctcacact cattgtgact 44640 

ctggccaggc cctctctgag cctctgtgtc ctcatctgga gcacagggac caggtgtgtg 44700 
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gaagcccgtg gcatagtgcc aggaacacag tagatgtgca cagtgtgcac .tagcaggaac 44760 

acacaacagg ggtactgact gtcagcacct aggcaggcac acgcaatggg gtactgactg 44820 

tcagccatac tgactgtcag cgtgctagca ggcatacaca acagctgtac tgacagcaca 44880 

ctagcaggca catgccatag gtgtactgac tctcagtgca ctggcaggca cacgcaatag 44940 

gagtaatgac agcatgctgg caggcacaca atagctgtac tgactgtttg ccccaatata 45000 

gtgccaggtc ttggagcaga ttttgacttc tcaccaagat caaatgcaga aagtgcacga 45060 

gcatttcaaa gatgtttttc acatgcacat tagtgctagt taaaaaaatg ttttgactgg 45120 

gtgcagtggc tcacaactgt aatcccaaca ctttgggggg ccgaggtggg cagatcacct 45180 

gaggtcagga gtttgagacc agcctggcca acatggtgaa accccatcta ccctaaaaat 45240 

acaaaaatta gccaggtgtg gtggcaggtg cctgtaatct cagctacttt ggaggctgaa 45300 

gcaggagaat cacttgaatc caggaggcag aggttgcagt gagccgagat cccaccactg 45360 

cactccagcc tgggcaacaa tatcaagact ccacctcaaa aaaaaaaatg tttttcataa 45420 

agtgtgactt ttatcagacc tctgcattct tgaaattaac tctggcttgg ctgggcgtgg 45480 

tggcccacac ctgtaatctt aacactttgg gaggctgagg tgggcagatc acgaggtcag 45540 

gagttcaaga ccagcctgac caacatgatg aaaccccatc tctactaaaa atacaaaaat 45600 

tagccgggcg tggtggcatg cacctgtaat cccagctact caggaggctg aggcaggaga 45660 

atcgcttgaa cccaggaggt ggaggttgca gggagccgag atcgcaccac tctattccag 45720 

cctgggcgac agagcaagac tctgtctcaa aaaaaaaaaa gaaagaaaga aattaactct 45780 

ggctcctaga aggagcccta tatctcagca ggacactcag tcattcaaca gacatctgtc 45840 

aagcacctgc tgtatgctgg agctgtgggt acgtcagcaa ttagaggaag agggcagggg 45900 

tacaggagtt cctgaccacc ccaggccagc acgctcctat agcagctggc aaggagcaga 45960 

tgactcagac ttcagctcag tccacaggac agccttttct ggccactgct ctcaggagat 46020 

gagatgtgtg gctgcaaaag gtaaactcct ggctcctgag caggctctgg gcaatctgct 46080 

caacgctctg tgcctcactt tctcacccag aaagtgtgga caatgagagg acttatctgg 46140 

ctgggcgcgg tggctcacgc ctgtaatccc agcactttgg gaggccgagg cgggtggatc 46200 

acctgaggtc aggagttcaa gacctgcctg gccaacacgg tcaaactcca tctctactaa 46260 

aaatataaaa aattagccgg gcttagtggt gcacacctgt aatcccagct acttgagagg 46320 

ctgaggcagg agaatcactt gaacccagga ggtggaggtt gcagtgagcc aagattgtgc 46380 

cactgcactc cagcctgggc aaaaagccaa aactctgtct caaagaaaaa agaatcatgg 46440 

cagaaggtga agtctatgtt agtcccagtt cccaggtcgt acatggcggc aggagaaaga 46500 

gagagagaag gggaaactgc cacttttaaa ccatcgggtc tcctgagcac tcactgtcag 46560 

aacagcctgg aggaaactga ccgcatgatc caaccacctc cctccaggtc cctccctcca 46620 

cacgtgggga ttacaattcg aggtgagact tgggtggaga cacagagccg aaccatatca 46680 

gcatgtatgg ggggcactga aacttgtgct tggtgcccat tcattcaacg agtgtgtgtg 46740 

gctggtctcc tcatcttcaa ctccctgccg agtctcagat aggcagcctg cagttccttc 46800 

accacaacag gcacatgggg ctgggtgcca gtgagtgctg gggcttctcc gagcactatc 46860 

tcacacccag gagcgtgggc acgcatggca ttcgcatgtg ccgtcagtgg acattaaaca 46920 

cagccatgaa gaagccacga agaagtgctg cctgccggcc gtgcgcggtc acgcagcgcc 46980 

aactccctcc tggggccttc tggggccttc tggggcatgg gagctggggc cgcctgagac 47040 

aaacatccgt gacgctgggc tgaccccaca gaacggtgcg ggcctcgctc ttggagtcag 47100 

ccctgctgcc agccagtgcc gggtgctggg gactcaggga ggcccgccgg gaccactgcg 47160 

ggacagtgag ccgagcagaa gctggaacgc aggagaggaa ggagaggggg cggtcagggc 47220 

tctcaggagc cgggtcctgg gcaaggcgca gccgttttca aattttcagg aaagcggtcg 47280 

gctcacactc gagcagtaaa aagatgcctc tggggaggag gcccgtgcag ctctccgggc 47340 

aatggtggtg gctcggccta gagaggcggt agtggaacgc agaccctggt gggggaatga 47400 

catcaaggga ggagacgggc gggaccccag atttctgcct gtgggcgatg gaagtgaggt 47460 

tcactggcca gcggagccgg acacagaacg cgcaaaacgc cgtgtaggcc tggaggagcc 47520 

gaagagcagg cggaccccct ccgcggggga acagtttccg ccgggagcac aaagcaacgg 47580 

accggaagtg gggggcggaa gtgcagtggg ctcagcgccg actgcgcgcc tctgcccgcg 47640 

aaaactctga gctggctgac agctggggac gggtggcggc cctcgactgg agtcggttga 47700 

gttcctgagg gaccccggtt ctggaaggtt cgccgcggag acaagtgagc agtgagtcgc 47760 

agtgacccta caagtggttc ttttacccga gcggctcgta ggcgcgttgc ggtttttcga 47820 

aactacagct cccggcaggc cccaagccgc cctcggggcc gcgggtcggc ggattggccg 47880 

cgctgcattt tgggacctgt agtttcctgc gctcgtggcg ctggcgccgc ggcgttggct 47940 

gagcccttga ccggggctgg agggaagggc cgacattcag tgtgtccgcg tctgttctgt 48000 

tagtcccagt tcccgggcgg gattgaggct tagagaagtt gagtgatttg ctgagggctg 48060 

cacgggttgg catcccggca tgctctttcg ctactttggc tgcatctggt tgcccacccg 48120 

ggcggatggg gaatggactc cagccagcca ggagggcaga gggctggaga ggcagggccg 48180 

gaggttcaga ccctccgctc tgacgttgcg cctggtgagg ccgggagggg tgccgcttgc 48240 

ctcttcagcc ctcacgctct tgtggaagtc gcggaattac tgcaggcgga acttgcagca 48300 

ctgtgggcgt cttttccaga gaaggacgga gttgtggggc gggaggataa ggcaaggccc 48360 
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agccacttcg catcttcgcc ccgccagctc ctcgagatgg gatataccag ggttgctctc 48420 

caaccctctc cgcaggaggg actgatggaa acgcctggga aagtagcccg gtacccacaa 48480 

aggctgtcta caaacagagt cttactgtct ttcccaggtc tgtgccatag ggattctcga 48540 

agagaacagc gttgtgtccc agtgcacatg ctcgcatcgc ttaccaggag tgcccgagac 48600 

cctaagatgt tcggagtggt tttttcgcac agacccgaat agcctgcccc tcagccacgc 48660 

tctgtgccct tctgagaaca ggctgatatg cccaagatag tcctgaatgg tgtgaccgta 48720 

gacttccctt tccagcccta caaatgccaa caggagtaca tgaccaaggt cctggaatgt 48780 

ctgcagcagg tagagcacag gccccgagga aaggactgcg ggtgggtgga gcttcagcca 48840 

ggacggggtg tgcttccctc tcccggccca ttccagccag gcccctccgg gccagaggca 48900 

gcgtctgtca taaaaagggc tggtgttcca ggtggggtca gagagaggat tgacaagtaa 48960 

aaacgatcgt cctttgaagg gggccggccc ctccacacct gtgggtattt ctcatcaggc 49020 

gggacgagag actgagaaaa tgaataagac acagagacaa agtatagaga gaaaagtggg 49080 

cccaggggac cggcgctcag catacagagg acctgcaccg gcaccagtct ctgagtttcc 49140 

tcagtattca ttaattacta ttttcactat ctcagcaaga ggaatgcggc aggacagcaa 49200 

ggtgatagtg gggagaaggt cagcaagaaa acgtgagcaa aggaatctgg gtcacaaata 49260 

agttcaaggg aaggtactat gcctggatgt gcacgtaggc tagttttatg cttttctcca 49320 

cccaaacatc tcggtggagt aaagagtaac agagcagcat tgctgccaat atgtctcgcc 49380 

tcctgccaca gggcggcttt tctcctatct cagaattgaa caaatgtaca atcgggtttt 49440 

ataccgaaac attcagttcc caggggcagg caggagacag tggccttcct ctatctcgac 49500 

tgcaagaggc tttcctcttt tactaatcct cagcacagac ccttcacggg tgttgggctg 49560 

ggggactgtc aggtctttcc catcccacga ggccatattt cagactatca catggagaga 49620 

aaccttgggc aatacccggc tttccagggc agaggtccct gcggctttcc gcagtgcatc 49680 

gtgcccctgg tttatcgaga ctggagaatg gcgatgactt ttaccaagca tactgcctgt 49740 

aaacatattg ttaacaaggc atgttctgca cagctctaga tcccttaaac cttgattcca 49800 

tacaacacat gtttctgtga gctcaaggct ggggcaaagt tacagattaa cagcatctta 49860 

gggcaaagca attgttcagg gtacaggtca aaatggagtg tgttatgtct tccctttcta 49920 

catagacaca gtaacagtct gatctctctt ttccctacag tccttgaggg tgacagactt 49980 

aggagtgcct tgggggcctc tctgaggagc agctgatatt cacgggtcag gaggaagcat 50040 

ttccattaga ggggcagccg gtggccagcc tcacttggaa ggtctttgaa cctcgggggt 50100 

gcagggaggt ggcagtggtg caggttgcct tctcctgggt tccttgaggt gccctcttgt 50160 

acccggctca cacccttccc ctccccgagt ttcctgctca ggttcccgtc tgagagcttg 50220 

tatgtaggac gtcagatagg acagcataaa tgtttggatc cagaaacgca gaacagtttc 50280 

ctattttgag acttgacacc taattagtca tcttactatt taagctgaaa aatagtgtcg 50340 

tgttttgggt aacgttctgc aaatcgtttg ctaatggcgg ctgagttgct tcacgccctt 50400 

tagggcaaga gtgggacttg cctgtggact tctccgcggt cccacagggc tctcgccacc 50460 

tggcagtggc ctctgcatct gcaaagagct gcccgctggc tgccgaagct tgtctcaggg 50520 

cagcttgtgt ggcctcgcct cttcctggct tccccgtaac ccttgctccg aactccgttc 50580 

agaaggtgaa tggcatcctg gagagcccta cgggtacagg gaagacgctg tgcctgctgt 50640 

gcaccacgct ggcctggcga gaacacctcc gagacggcat ctctgcccgc aagattgccg 50700 

agagggcgca aggagagctt ttcccggatc gggccttgtc atcctggggc aacgctgctg 50760 

ctgctgctgg agaccccata ggtgacccta gttcccaggc ctctcctggc ctcctgtggg 50820 

gatggttggc aagggatggc gctgagggtg gggtgggccc atggggactc ctgccgtctc 50880 

tcaagcagaa ctcaaggaga attttttagc tgctgtataa tttctcgcca tcgtgggtgt 50940 

aaacctaggg ttgggctttt ttgctgaatt agggcacggc agatgcccac ttcacccatt 51000 

tttgataaac cagtatctgg ggtgtcagat tcttggctgt ctgcagggcc gagttagccg 51060 

aatgccacct gcctttgata cgtgagaacg ttgtctgaga accgtgactt ctgtgcttgc 51120 

ttgtgtctgg tcagcttgct acacggacat cccaaagatt atttacgcct ccaggaccca 51180 

ctcgcaactc acacaggtca tcaacgagct tcggaacacc tcctaccggt gggtcagacg 51240 

agtttacacc tgtctcgggg tcctcaagag aaccagcttg gcatggtgct gagtccacag 51300 

ccccatgctg tgctgtggtg gagggtggtg gtctttctag acgctccccc gaagtgtgca 51360 

gagcgctggt gcccaggggt ggggtgcggc ctgggctgcc tccaatgccc attacttgtg 51420 

aggaagcagc tttgcatctg tgtgctgacc ttgggcgggc gtcctgagct cctcgcaggt 51480 

gctgttgtag cagctgtgca gtaggtcagg gctggccccc agtgcagctt tgcacatgaa 51540 

gtaggaggag gccctgctgc ttgtcagagc ccagcagagt cttggtgttc tgtcgggttc 51600 

ctgtggccgg accagtggca gggtgctgtg gaagctgtcg aatctcctcc ctctgtccag 51660 

tacccccgct cgtcttctag ctccctccta cgcccgggcc acgtttcagt tatgctcact 51720 

tcctctgacc gccgaggctc ctgcgtgtct ccatacagct cacgctgcag ggccacgctg 51780 

tgggtgttgg agacagctcc tcctcgaccc acggtgctct ctcccaccag gcctaaggtg 51840 

tgtgtgctgg gctcccggga gcagctgtgc atccatcctg aggtgaagaa acaagagagt 51900 

aaccatctac aggtaggctc ctgggctccc gctccggctc agtgtccgac aggcgagtgc 51960 

tgctgggtgt ccagagcccc aggctgcgct cccgctgggc tagggtttga agttcactgg 52020 
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gggactgcag gggaggacct ggtgggggtg gggactggct tcggtccttt cttggccgtg 52080 

cttcagctgc gcactctgcc cttcctccca cagatccact tgtgccgtaa gaaggtggca 52140 

agtcgctcct gtcatttcta caacaacgta gaaggtacaa gcagctgggt gggaccaggg 52200 

tcgggttgga gtgtgtgcag cctctcaggg tggagctcag tggtgtcaca gcctggttgt 52260 

gcttgcccgg tggggcggcc agtgcggcca tgtacctggg ccctgtcttc tgactcgggg 52320 

ccacccatgt tagacttctg tgtggaagag ctcacacagt ggtctgagac agccagccgg 52380 

caagactgcc tctggctggt gcctggggcc ttggattttg ggaaggctcc ctccatttcc 52440 

tgatgagagg gtctccctgc acctaacctg ctggtgcaaa cagtaggggt tttgctgaac 52500 

accggctttc tcttcgggga ctttgttgct tgcccagcag caggtgctcc agtgaccggc 52560 

cctcatacca tcttgggagg gtgtcctgga agccgtgtct ggcctcccgc gaccctgccc 52620 

cgtgtgtctt tttcctgtgc tgaccttgct gcggaaaatt atggccctga gtgtgactcc 52680 

aggctgagtc ctgtgggtcc aacacgggat gccttggggc ctcttctgga gacgggatgt 52740 

gagtgacagg agccggccgg ggcagcttgc cctgtgactg cacgtggcca cagcctgtga 52800 

gggccggggg tgcttctcca cccacgtggc tgcccctcgg gtatgtcaag ggcttctggg 52860 

gctcatcacg gggtcctaga gacagtggca gggtgcaccc ccgttggctg cccttacagt 52920 

ttctgtgacc tgagggtggc atctgtgcag tcggcgcggt ctgtgcttct gtgggatcag 52980 

ggttccctct gtttcctgcc tcagttgggg ctcaagcctc aggtgaggtg gccccggagc 53040 

actcagaagg catcggcggt cctgtgggct gctttctgca ctcacgtttg ctgagtgctc 53100 

agtgtgccag gactgaggac cctgaagctg ctcttgtatt tagggcggcg ctcccctggc 53160 

agagactgag ccaggtggtc ccgcatgacc cactaccagg cgtttctggg ccctggccct 53220 

tggagggaca gggtgggcgg aacatgggcc tgcagggagg ctcccgctta ctggaggcat 53280 

gtgctgtgtt gctggagaca tcctctgtgt tgcttcttgt tcgctgtggt ttttggtctg 53340 

gtggcaccaa ggaccctcag tcatcttgat gtgtggttgt ccaggccttt ttgttggtcc 53400 

taagaagggg ctctgccttt gtgcccccag gttccctgac aggagctgcc ggctcgtccc 53460 

ggtgatgcct gcaggacgtg actctgggac ggggggttgg gcagatgtgc tgatggaaat 53520 

tctcaagcag gcgtcatttc cgaggtcctc acctggattt ccaggacagg agtgcctgct 53580 

gggtgtcccc agtcccatgc agcgggggtc cttgggatag catggaacgc tgagcatggg 53640 

cctggccggc cgtggtcctg gacaagggca gtgccccggt ggctgctggg cctgggacct 53700 

ggtggggacg ctgggcctgg tacctggtgg ggatgctggg cctgggacct ggtggggagg 53760 

cctctgactg cctcctggtg ctgcttccgt ctgtgttagg cctctgggta ttggggcccc 53820 

catctgtctc ctcctccagg cctgtggact cagaccagga agacacaggc cagcccctgc 53880 

ctgtccccct tggcttgggc tctcactgcc cgacctggcg ggaggttgcc tagccgtgaa 53940 

ccttcgcacc ctgtctgcca ccggacaggc tgtgaggggg tgtctgcagc acctgcaccg 54000 

gcctgagcat cttcagagtg ggctgcagct cctggagggg tctgagagga agggaggcag 54060 

gtattttggg cgaatgagga gacagctgga gagctggcac ccttcctggc ctgcgtcctg 54120 

tgaggactct ggttggggac agcaagcttg gggtcagcct ggggcagagc ctctgggacg 54180 

gccccgcccc tcgtgcccct tcccctcgca gctcctgtcc tcgccccgcc ctcagctctc 54240 

cgccaggcaa ggtttggcaa gtgccgctgt gcggcagtgc ctgctgattg gctggtctgt 54300 

tgctatggtg ctgcccaggg gtgtgctttt cctcccctgc cttccctgct atccctggga 54360 

gtatctgggg ttgggtcatc gctggtgtgt gtgagtgtgt gtgtgtgtgt atgtgcacgt 54420 

gtgcatatgt gtgcgcttct ggcctctgca gctgagtcct ggccctcggg gggcctggca 54480 

cctcctgggg acaggcacaa agcagccatg atggagtcgg gagctggggg aggccccatt 54540 

gccccacgtg gctgccctgt gactctgggg tgcttgttag aagaggtatc tggttctgtc 54600 

tgtgtttaag caactcccta aggaattctt gtggttccag tttggggggc ctgtactgta 54660 

gaggcaaggg aggggcagga catcccccag actctgactt ctgaagcctt ttctgcccgg 54720 

ggcctctccg ccagtacagg cagtgtcctt tgccagggct gccatgctgc agaggggagt 54780 

gggccactgt ttagcccagg aaaacctggc tctcccttag ctggaagttc tgggcctgtt 54840 

gtggttggca gggaagctga gtgacggtgc taatcacagg ggcacctgca ggggtttgtg 54900 

ggagatgcct ctgtgggttg gggcgatagg ctgaggggct gttcttccct gccctgagga 54960 

gggctgagtg tagccgccac tcctgtcctg tcttgggctg tctcggagag gatgcgtaga 55020 

accctcggga tcctgctggc ctccgtctgg tccaccctga acctcaggcc ttctgggggc 55080 

agaggaggat tccctcagga tcactcgggt gggggcctct cttgggcacc tgagaccctc 55140 

agtgggtgct ttgtggcgcg ttcacggttg gtgggggacg cccagccctg cccgccgtgt 55200 

aggagccgtt ctgtcctggg catccccctg tggtctggga cttagtggac cctgagggtg 55260 

tgtgtttacc cctgcctcac acctgcagaa aaaagcctgg agcaggagct ggccagcccc 55320 

atcctggaca ttgaggactt ggtcaagagc ggaagcaagc acaggtgaga cccctcagtg 55380 

aggccacgac cactgtcctt ccatggccca gctctcctgt gacctgtgga ggcccggata 55440 

tatttcttca cttttctttg ttccttttta aattatgaaa ctaaccacca ttcagtacga 55500 

aaaagtttaa gcagctctga ggaagataga gtaaaaaatt gtctccctct tccctggccc 55560 

tcagccatcc ccggtggcca ccgtggagtg tggacggagc cctgcaggcc tgtgtctgtg 55620 

cggaagcacg cgcagttttg tctgcacaga ctgtcctgca gttggctgtt ttcactcagc 55680 
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gttgtgggta tagcttccca tgctggtgct ggcagctcgg ccttgttctt ttgaggacag 55740 

cagatgtctc ctatgtctac ctcttacagc ttcagagatt caagttataa taaagctctt 55800 

cttatattga gggggaaacc tccctccccc ttttttttga aacagggtct cgctctgcta 55860 

cccaggctgc agtgcagtgt cacagtcttg gctcactgca gcctcagcct cccaggctca 55920 

agcgattttc ccacctcagc ctcccaagta gccgggactg caggcacgca ccaccatgcc 55980 

tggttaattt ttgtattttt tgtacagaca gggtctcact ctgttgctca ggccagtctc 56040 

ctgagctcga gagttccacc tgccttggcc tcccaaagtg ctgggattac aggcgtgaga 56100 

ccccatgcct ggccagctct tttttttttt tttttttttt ttgagacgga gtctcgctct 56160 

gtcgcccagg ctggagtgca gtggtgcgat ctcggctcac tgcaagctcc gcctcccgag 56220 

ttcacgccat tctcctgcct cagcctcccg agtagctggg actacaggtg cccgccacca 56280 

cgtctggcta attttctgta tttttagtag agacggggtt tcaccgtgtt agccaggatg 56340 

gtctcgatct tctgaccttg tgatccgccc acctcggcct cccaaagtgc tgggattaca 56400 

ggagtgagcc accgcgcccg gcccagctct gctttttctt agtggttctg cgttgtgttt 56460 

gtttctatcc aggaataggg ttggttttac ttttccatcg agtttttaaa gagacgacga 56520 

tttacatggt cggaaactca cgaggactcc ccatcccttg gtcggaaact cacatggact 56580 

ccccatccct tggtcagaaa ctcacgtgga ctcccatcca tcccaggcag cagcttccca 56640 

cctgggccct acgtgcagga tgagggctcc ttccgggtca gaagacatgg cggcctcggg 56700 

gcaccgtccc ctgcatgggg tgctcacagg atcttctcct ctctccttcc cagggtgtgc 56760 

ccttactacc tgtcccggaa cctgaagcag caagccgaca tcatattcat gccgtacaat 56820 

tacttgttgg atgccaaggt gggggctcag tcctgtagct gacgactcct gatgtccagg 56880 

ggtgtccctg ggcttgggaa cagctgtccg agcctttgct gcttcagggc cttagatcag 56940 

caggcctggg tgggaggact cacctctgtc actgggcagg ggctcaacct ggccagacac 57000 

acttgtgagc agccccaggc cacaggtcag ttttctgagc agtctgggag cgggcaggct 57060 

ggtgggagtg aggagagacc tccaggctgt ggtccatagg ccagtgcccg ctcttgatcc 57120 

tgacagctca ggttctctcc ttcacgtcag gccatgggag gcaccgagaa cacaggaagc 57180 

ccactgactc ccctcttccc agcgcgtgcc cggccccaca ctcactcccc ctcccagcat 57240 

gtgcccggct tcacactcac tcccctcttc ccagtgcatg cccggcccca cactcactcc 57300 

ccccacagca tgtgcccggc ctgacactca ctcccctcct cccagtgtgt gcccagcccc 57360 

actcccttcc gccccgtgtg cccagcccca cgctcactcc ccccgccagc atgtgcccgg 57420 

ccccacactc aactcccctc ctcccagtgt gtgcccggcc ctgctgccct cctccccatg 57480 

tgccctgctt ttgtgcccca cactttttac ttagtgcagg tgggatcaca cgccacgggt 57540 

caatggtttg Cgtgttcacg tgacgatggc gtggtgacgt ttccagatcc cgtcgttggt 57600 

tcgctcattc tcggggtgta tatttattga gagctcatca tgctgggtgc tattccaggc 57660 

atagcaagac tggcttcact cacatggagc tttgattcta gtggtgggga caggtggaca 57720 

gcaaaagagt aagcacgtga gctgacgata ctgaagggaa atagagcaga gggaggaggc 57780 

ggagaccgag ccaagcgggc ccaagtgcga tgtcggcggg aggtggggaa tgctggtggg 57840 

tctgagggga gcctcagcag gtgcagcaga gcaagggaag aggtgagtgg gggcggctgg 57900 

ggggccgact cctgggaagc tgtagcagaa ccccacagag agctggtgag gtttgccgtg 57960 

gttgtgggtg actcggtgct ttgagccctg gctgcccctg ggaaccatct ggagagcttc 58020 

taacccaacc aggcccctcc ctgggacagt tatatcacag ctggtaagcc gagtctaaca 58080 

ctttcacgga aacgcagaag atctaaaaca gcaagatgac cgtgaagaag aacagagctg 58140 

gaggactcac ctcgctggtt tcaagactcc tctaaagctg caggagtgga ggtggagatg 58200 

gcccagctca ggcacaggcc tgcaggccat ggagaaggca gcaagctcaa gctgacccac 58260 

acgcatgtgg tcattgtttt ttttttcagt tggaatctca ctctgtcacc caggttggag 58320 

tgcagtggca ccatctcggc tcactgcagc ccccgcccct aggttctagc gattctccca 58380 

catcagcctc ccgagtagct gggattacag gcgtgcgcca ccatgcctgg cccttggtga 58440 

ttgttttttg acaaacatgc caatttaatt gagagaggaa atgaaggttg atttctggtt 58500 

ttctgaaaaa atggtgctaa gaacagctgg atatctgttc ggaaaacagt gaatcttaac 58560 

tcttgtttta ccctgtataa acctaaatgt aaaagctaaa ctaaaagtta tagaaaggaa 58620 

catgggggag gtctttgcaa ctttggggta ggcagagatt tcttagtatg gatacacaag 58680 

gcactagcca tgaagaaaaa cattaaaatt tagacttcac caaaatttaa agcttcaact 58740 

ctgtggaaga gttgagaaaa tgaaaaagca gttaaagaaa gggagaaaat acttctttca 58800 

aaggacttaa aaaatttttt cagccctcct ctgatttgaa aggacctttg accagagtat 58860 

gtaaaattct cccataacta agcaaacaac ccacttaacc actgggaagg gatctggaca 58920 

gacgtttcac caagatgggt ggaatggcca gttaaccact gggagagcat ccggacagac 58980 

gtttcgccaa gatgggtgga atggccagtt aaccactggg agagcatccg gacagacgtt 59040 

tcgccaagat gggtggaatg gccagttaac cactgggaga gcatccggac agacgtttcg 59100 

ccaagatggg tggaatggcc agttaaccac tgggagagca tccggacaga cgtttcgcca 59160 

agatgggtgg aatggccagt taaccactgg gagagcatcc ggacagacgt ttcgccaaga 59220 

tgggtggaat ggccagttaa ccactgggag agcatccgga cagacgtttc gccaagatgg 59280 

gtggaatggc cagttaacca ctgggagagc atccggacag acgtttcgcc aagatgggtg 59340 
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gaatggccag ttaaccactg ggagagcatc cggacagacg tttcgccaag atgggtggaa 59400 

tggccagtta accactggga gagcatccgg acagacgttt cgccaagatg ggtggaatgg 59460 

ccagttaacc actgggagag catccggaca gacgtttcgc caagatgggt ggaatggcca 59520 

gttaaccact gggagagcat ccggacagac gtttcaccaa ggtggatgga atgaccagtt 59580 

gagcacatgg aaagtcgccc agcatctcca gtcataggag aaggcagatt aaagccacgg 59640 

ggagccgaca ctgtggtccc actggcatgg ctgaaattca gaagccctga gtgtggcatg 59700 

aggatgtgga acagctggat ctcatccatc gctgtgaagt tgtgtagcca ctccacaaac 59760 

gtgtggcaaa cagccgagcc gggagaaggg aagacgtgtt caaagattca tatgtggcca 59820 

ggctcagtgg ctcacgcctg taatcccaga actttagggg ccaaggctgg gggatcgctt 59880 

aagcccagga gtttgagacc agcctaggca acatagggag accccatctc aaaaaaaaaa 59940 

aaaaagaaaa aagaaaagac ttcagtgtgc aggtttacca gagttttgtt tgcagttgcc 60000 

aaaactggga agcagcccgc gtgagcccat ccacaggtga atggacagac cgtggtaccc 60060 

gaacactaac agcagccacg ggcgtggact gtggtcacac agcagcaggg agccgatgag 60120 

tctcggacat gctaacccag agaggcccat tgaggaggac ctactgtttt ttgtgttttt 60180 

gttttttgtt ttgaaatgga gtctcgctct gtggtgcagg ctggagtgca gtggtgtggt 60240 

cttggctcac tgcagcttcc gcctcttggg ttcaaacagt tctcctgcct cagccttccg 60300 

agtagctggg actacaggca cccgccacca cacccggcta atttttgtat tttcagtaga 60360 

gacggcagtt cgccatgttg gccaggctgg tcccaaactc ctgaccttgt catccactca 60420 

ctttggcctc ccaaagtgct gaggttgcag gcatgaacca ccgcacccgg ctggacctac 60480 

tgttttattc catttatgtg acactctatt aatagaaaag gcaggggtgg ggctggtggt 60540 

tatatggtgc acataactgc cagaactcag tacacttaaa atgaacatct taatgtgtga 60600 

aatttttttt tttgagacgg ggtcttgctc tgtcacccag gctagagtgc agtggtgcga 60660 

tctccactca ctgcaagctc tgcctcctgg gttcacgcca ttctcctgcc tcagcctccc 60720 

gagtagctgg gactacaggc gcccgccacc acgcctggct aatttttttt ttttttttgt 60780 

atttttagta gagacggggt ttcacagtgt tcgccaggct ggtctcgatc tcctgacctc 60840 

gtgatccgcc tgcctcggcc tccgaaagtg ctgggcttgc aggcgtgagc caccatgccc 60900 

ggccaatgtg tgaaaattta aaagtaccaa agctggaccc caccccagat tgctcccatg 60960 

acactctgtg ggtgggacct gggagttggg ttttgttttg ttttgttttg tttttgagat 61020 

gaagtctcac tctgtcgcct aggctggagt gcagtgacac aatctcggct cacattaacc 61080 

tctgcctccc agatgaaagc gattctcctg cctcagcctt ctgagtagct gggattacag 61140 

gcacacacca ccaccccctg ctaatttttg tatttttagt agagacgggg ttttaccatg 61200 

ttggccaggc tggtcttgaa ctcctgacct cgtgatccgc ccgcctcggc ctcccaaagt 61260 

gctgggatta caggcgtgag ccaccgcgcc tggctgggag ttgggtttgt aaatctccct 61320 

gagtggggct ggggcaggga actgctgggt ctgggtcttc ctggctcctc tggtctgtgg 61380 

cttcctgact gcggtggccg ggggctccca gggcatcgtg gccgtctgtc ttgctgagcg 61440 

tggcacgtgc ctttccatgc tgtggaggag cgtctcccgg tatggcgaac tgctggttag 61500 

ggtggggcgg tgttgccagg tcatccaggt ctggcctctg ctctcgacat cgccggcgct 61560 

gttgctcatc tgcgcttgtg atgttcgatg cctgctgcac atgtcttggc ttccctcttt 61620 

cccggcctct gtgagctcca gcgctgcgtc ccttctcttc ctcctgtaga gccgcagagc 61680 

acacaacatt gacctgaagg ggacagtcgt gatctttgac gaagctcaca acgtggtgag 61740 

tctccgctgg cctcctaaac acctcctatt gcttctggcc tttttgtcaa gagccacgca 61800 

aacctttctg gaggggctct ggccaaactc ctgaagccct aggtgcccag gactggggac 61860 

tgagcacacc aggagcttct gccaccccct cccgccctga tccgatgcct ctgctggggc 61920 

tggagactgg ccagctgggc cagggacctg cccgtcaggc gcagggcccc cacaggccgc 61980 

tcaccagacc ctttccctcc agccagctcg gggtcagcct gggccagggc tgtctcctct 62040 

gccctcggca gcagcaggct tgtggtcttg cctgcagtgt ctctgccctt ccggccacat 62100 

ggcttgagac tgaggcagga gaatcgcttg aaccttggag gcagaggctg cagtgagcca 62160 

ggatcacacc actgcattcc agcctgggtg acaaagcggg attctgtgtc aaaaaaaaaa 62220 

atgttgactg ggcgcgctag ctcatgccta taatcccagc actttgggag gctgaggtgg 62280 

gcggatcacg aggtcaagag atcaagacca tcctggccaa catagtgaaa caccgtctct 62340 

actaaaaata caaaaaaatt agctgggcgt ggtggcgtgt gcctatagtc ccagctactc 62400 

aggaggctga ggcaggagaa tcactcgaac ccaggaggta gaggttgcaa tgagccaaga 62460 

tcacaccact gtactccagc ctggtgacag agcaagactc cgtctcaaaa aaaataaaat 62520 

caaaaagaat aattggcaat tccagtgaaa taattgtttg tttgtttgtt gagacagggt 62580 

ctccttctgt cgtccaggct ggagttcagt ggtatgatct tggcccactg caacctccac 62640 

ctcctgggct caagccatcc tcccacctca gcctcccgag tagccgggac tacaggtgca 62700 

caccaccacg cccggctaat ttttgtattt tttgtagagg cggggtttcc cagcgttgcc 62760 

caggctggtc ttgaacccct gagctcaagt gatctgccca ccttggcctc ccaaagtgct 62820 

gggattacag gtgtgagcca ccgcgcccgg cctgaaacaa tcgtttctaa atattggtgt 62880 

gggccacaca gtcatgtttg gacctacttg tggcctttta cagaccccag gccaaggctt 62940 

tgggaacttg gctgtcagcc tcctgtgcct tctgcacccc caccccattt ctgctttctg 63000 
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gaacccccga 
tgtgtgaaga 
tcatagacca 
agttcagcgc 
cctgcgctgc 
gcgttcatag 
tgtggctgtg 
cttctctcag 
ggtattgttc 
acttggcctc 
cgcttgaggc 
aaaaattagc 
gggaggcgtg 
tccagcctgg 
ttcactgtaa 
tctgcctgta 
acggcatatg 
tttctttttt 
gcgcgatctt 
cctcccaagt 
agtagagatg 
cacccgcttc 
tgtccatatt 
cataggttct 
ttcccccatt 
ggggagcccc 
aagttggcat 
tatatatata 
gtggtggtac 
tctcagcgtc 
tgccacgcag 
ctcctctcca 
tccttcgctt 
ccatttcatc 
atcaccacgg 
tgttgtaact 
aacttcagtt 
cacccaggct 
ctccacctct 
ggcgtgtgcc 
gcccaggccg 
cccggtttac 
ccgggctcca 
cttcccatgt 
aagatggcag 
ccctgcacca 
gggggtgggc 
aggcggatca 
tactaaaaat 
tgggaggctg 
atcgcaccat 
tcaatcaagt 
ctccctcctg 
tcttgtgtcc 
ctcatgtcct 
ctgcaagacg 
ggtcgttgtg 
tgttaattct 
tttgggagac 
cagggtgaaa 
ctatagtccc 



tcctgtcctg 
atcggcatcc 
ggtgctggag 
ggactccccc 
tgcagggtga 
ccaggctgct 
tgtggtcggc 
aggcttcttg 
agtagttctg 
tactgtgtaa 
caggagtttg 
caggtgtggt 
cttgagcctg 
gcggcagagt 
gggcattttg 
ggttccttcc 
tggaaaacgt 
ttttttttga 
ggctcactgc 
agctgggatc 
gtttctccgt 
ggcctcccaa 
ttctacatgg 
gctgtgacac 
ttcctttaaa 
gtcctcaccc 
ttgtttagta 
tatacatata 
aatcaaagtt 
ttaaagcgcc 
ggagttgcag 
cagtccccca 
ggcgcagggt 
gcttacataa 
ggccttctcc 
agtgggggac 
ttcatcccta 
ggagagcggt 
tgggctcaag 
actgtgcttg 
tttaccctgc 
tctgaactta 
gatgggacgt 
gaacattgtt 
tgtgctaatt 
tatcagctat 
cgggtgtgct 
cgaggtcaga 
acaaaaaaat 
aggcaggaga 
tgcactccag 
gtcatcactg 
aatctcactc 
tgaccccggc 
gctccagacc 
gtatttcaag 
actgggtgtt 
aaaggtcaaa 
caagactggt 
ccccatctct 
agctagtcgg 



ttctgtggtg 
tttgacctga 
gagcagacca 
agcccaggtg 
gccccacccg 
tggtcctgag 
ctgggcacca 
ggtgtgaatt 
gtattttcca 
aactggaaaa 
agaccagcct 
ggtgtgctcc 
ggaggcagag 
gaggccatct 
catctttaaa 
cagtgtttgt 
ggaaaccctt 
gatggagttt 
aacctccgcc 
acaggcgtgc 
gttggccagg 
agtgctggga 
cttctgtaaa 
tacttgctcg 
atcgcacatg 
ttctcaccag 
cagaagttat 
tttaagtgac 
ccctgtagcc 
gggaccacag 
tgggggaagg 
accccacctc 
ggggagagtg 
cgtgggagaa 
actcttcagt 
gtgtggtggg 
tctgttcccc 
ggtgccatca 
tgatcctcct 
gcagggtcca 
agagtcggaa 
ggtggcttga 
cctttctatc 
gctgctgaac 
ccatcatcta 
gtggtgatcc 
gtctcacacc 
gattgagacc 
tagccaggtg 
atcacttgaa 
cccggcaaca 
aatgtttgtg 
ctgacctcag 
ggttgtggcc 
tggaatcagc 
acccgccgtg 
ttcgtggaga 
cacccggcca 
ggatcacctg 
actaaaaatg 
gagacagaca 



attcgggtgt 
ctccccatga 
aggcagcgca 
cgttcatagc 
gagttcagca 
gcccgtgcta 
tctattcagg 
cttcagggtc 
aagacctatg 
ctctactttg 
gggcaacatg 
tgtggtccaa 
cttccggtgc 
caaaaaaaaa 
tgacccacaa 
ccagaggtgt 
catggatgtt 
cacttttgtt 
tcctgggttc 
accaccacgc 
ctggtctcga 
ttacaggcgt 
cagctgacta 
tgagcatctc 
tggacggaca 
gatggctgct 
caggtgttct 
agggtctcac 
tcggcctcct 
gtgtgcacca 
ggttcctggg 
tctaacgggg 
acaggtctcc 
acatccaccc 
tttgttagtt 
gtgggcttct 
cacccctttg 
cggctcactg 
gcctcggcct 
ttcttttcct 
tctgtacagg 
gggccccagt 
agaaggctca 
acctgaatat 
gtcagttagc 
cattcacaca 
tgtcatccca 
atcctggcca 
tggtggtgca 
cccaggaggc 
gagcgagact 
tgtgaacgtg 
ttgctgcacc 
tctgctgtct 
caagtctcca 
cggcagcggg 
tacagccata 
ggcataaggg 
aggtcaggag 
cgaaaattag 
cgagaattgc 



gcttgggctc 
cctggcttca 
gcagggtgag 
cagactgctt 
cggactcccc 
ctgcagtggg 
ctggcactgc 
ctgtagcctg 
tcttctccca 
tgaagctgag 
gcggaacctc 
gcttttctgg 
cccagatgac 
aaaaaggaaa 
atctggcatg 
atttccacac 
gtcagttggt 
gcccaggctg 
aagcaattct 
ccagctaatt 
actcctgacc 
gagccgccac 
ggagtctgtg 
caggtgtaaa 
ccacggggac 
tggtagagag 
ggctttagaa 
tctgttgccc 
gggctcatgg 
ctgccaccgg 
actttgaacg 
tggacggccg 
ttccctcatc 
acccccaggc 
acttgatgtg 
gccatctcat 
gagatggggt 
cagcctccac 
cccaagctcc 
cacactttat 
aggggcagcc 
tagactgcgg 
cagtatctcc 
gttaatcact 
aggaataact 
ggaaaggtgg 
gcactttggg 
acacggtgaa 
tgcttgtaat 
ggaggttgca 
ccgtctcaaa 
gggattggtc 
ttgaggtgtt 
gggagtcagg 
agaagccctg 
ttggtcatgg 
cgcacaggtg 
ctcagcggta 
tttaagacca 
ccgggcatgg 
ttgaacctgg 



taggagaaga 
ggactggacg 
ccccacccgg 
ggtcctgagg 
cagcccaggt 
cagcctgccc 
agggcatccg 
tggaagggct 
gccagtatca 
ttgggagcat 
gcccctgcca 
aggccgaagt 
tccactgcac 
actaaatata 
catcagctgc 
agcgctagtc 
ctatattttc 
gagtgcaatg 
cctgcctcag 
ttgtattttt 
tcacgtgatc 
gcccggcctt 
tgaatatctt 
cagcatcagc 
cctggaccct 
tgagtttgca 
tccctttata 
aggctggaat 
gatcttcccg 
ctctcaagat 
ctccacctcc 
cctctttcca 
tcggcagctg 
ctgtgtgaac 
cagggctttt 
tcaggaccag 
ctcactctgt 
ctcctgcagc 
tgggactaca 
ttattgaaga 
acacgagttc 
ccaccgtttg 
tttcccgttt 

gggggcttgc 

taggaccacg 
gacaaatgct 
aggcccaggc 
accccgtctc 
cccagctact 
gtgagccgag 
aatcaatcag 
ctgccccatg 
ttctgtgggc 
atttttcaca 
ctttcttttc 
ttactgggtt 
tgttcacaaa 
atcccagcac 
gcctgagcaa 
tggcgcacac 
gacatggagg 



63060 
63120 
63180 
63240 
63300 
63360 
63420 
63480 
63540 
63600 
63660 
63720 
63780 
63840 
63900 
63960 
64020 
64080 
64140 
64200 
64260 
64320 
64380 
64440 
64500 
64560 
64620 
64680 
64740 
64800 
64860 
64920 
64980 
65040 
65100 
65160 
65220 
65280 
65340 
65400 
65460 
65520 
65580 
65640 
65700 
65760 
65820 
65880 
65940 
66000 
66060 
66120 
66180 
66240 
66300 
66360 
66420 
66480 
66540 
66600 
66660 
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ttgcagtgag cagagatggc gctgctgcac ccctgcctgg gtgacagagt gacaccctgt 66720 

ctcaaaaatg aatagataaa taaagataaa acacctgctc ctcttggtgt ctccagtttg 66780 

gatttggcct gtgtagcctc ttccttcgcc tgttggtgga tttggcctgc acggattctg 66840 

tgtggcctct tccttcccct gttggtggat ttggcctgca cggattctgt gtggcctctt 66900 

ccttcccctg ttggtggatt tggcctgcac ggattctgtg tggcctcttc cttcccctgt 66960 

tggtggattt ggcctgcacg gattctgtgt ggcctcttcc ttcccctgtt ggtggatttg 67020 

gcctgcacgg attctgtgtg gcctcttcct tcccctgttg gtggatttgg cctgcacgga 67080 

ttctgtgtgg cctcttcctt cccatgttgg tggatttggc ctgcatggat tctgtgtggc 67140 

ctcttccttt ccatgttggt gtcctttttt ccatgccagg aatcctggtt ctcaagggcg 67200 

gggttgttgg cacgagcgtg atgcagactg cctttgctgc ctttctcttg cccagggctg 67260 

aacatggagc tggaagacat tgcaaagctg aagagtaagt gttgccctcc ccgcctcctt 67320 

gcagctgggt ggggcctcct ccttgcgagg aggtgggtga cacctcctcg acccacagtg 67380 

atcctgctgc gcctggaggg ggccatcgat gctgttgagc tgcctggaga cgacagcggt 67440 

gtcaccaagc cagggaggtg agaggcgggg agccagcccc ttcactgcag gcccagccta 67500 

gagctagaaa cgggccatgg tgcagtcctg ggctgtcaca tcacgagtga ggcctgtttt 67560 

caggcctgtt ttcccttttt gagacctggg aggagcacct gctttgcatg atctggttgc 67620 

tgagatgttg agaggagcag cacacactcc cacgggacag cacacagccc cccacggaac 67680 

ggcacacaca cccatggaac agcacacaca ctcccacgaa cagcacacac actcccacga 67740 

acagcacaca cactcccacg gaacagcaca cacacccacg gaacggcaca cacacccacg 67800 

gaacagcaca cacactccca cggaacagca cacacaccca cggaacggca cacactccca 67860 

cggaacagca cactctccca cggaacagca cactctccca cggaacagca cacacactcc 67920 

cacggaacag cacacacacc cacggaacgg cacacactcc cacggaacag cagactctcc 67980 

cacggaacag cacacacact cccacagaca gcacacacac acccacggaa cagcacactc 68040 

tcccacgcgg ggccgctggg tttcctgcag tttctcctcc tccaggcctt tccctggacc 68100 

ctggtccagt ccgtcatttg agcacaggtg cctgttagaa cgagaccttc ttgttaggac 68160 

gatgagtgtc ccagccacca cctcttttgg actccgggag gcctggaacg ttctgaacgc 68220 

tccgtggggc tccagtcttc tccgcagcca gggcagcagg gtttgctgtc tgtcctgcag 68280 

gcagatgagg agtcagggct ggggcctgtg tgggggctct cctgagcgcg cagccgccga 68340 

ggtggagcgt gttctgcctg agcgccgacc tggtcggggg aatcccagtt gcttccaggt 68400 

ggagccactg tcctcagcgt aatgctcaag gctctggcct ggctcctcgg ccaccctgca 68460 

ccctcagggt cccctcctgt agcttctgct gccccatcac tgtcactctc caaagctttg 68520 

gggactctgc ccagagccac cgcctcccag aagcccctga caacctcttg acgaccccct 68580 

agtgacccca tccctcccct ctgacggcgg cccctgctct gaggcggctt cttttcctcg 68640 

gtgctgttct cgtgctggcc aggcctcctc tccccacctg gaggctcctg agggcggagg 68700 

cctctcacct ccaatgctgg cgtcccctgg agggctgaat ttgtttccga gggaaggaaa 68760 

cttccacagt tgttgccttc agttccaaag ctgcagcctg atttccccct ccaggctcga 68820 

gcctgttttc ttctcggcag ctacatcttt gaccagtgtc gtcccccctc aggcccgagc 68880 

ctgccttctt ctcctcagtt cccaaagctg cagtctggtc cccccgccag gctcgagcct 68940 

gccttcttct cctcggcagc tacatctttg agctgtttgc tgaagcccag atcacgtttc 69000 

agaccaaggg ctgcatcctg gactcgctgg accagatcat ccagcacctg gcaggacgtg 69060 

agtgctggca cggggtcttt ggtgcgggca aatgtggcgt agggggtgca gcaggcctcc 6912 0 

atcttggcag tcagggctcc cctggccgtc acctggccgt cagcaggaac aggcccacag 69180 

aacctcatct tctgatcggg gcgtggaggc gttagtgcca cttgccagct gccgtagagc 69240 

ctgtcccagt tctgcagctg gcggcttcgt cctacagcct catcccatta ttctgctttt 69300 

gagaaagagc agcccaaggc cctagctggc ttgtggggcc tctggcttct ccacaccacc 69360 

ccgagttctg cttctcagag ttgtggggtc cagaggcttt gcccagaggc ggtgtcccca 69420 

tgggctgctc tggtttgaga cgccgggccc agcggggtct ctcctctgct gcgctcccgg 69480 

gtgctgggga gggtggcttt tgctgcttca acccttaggc gaccatagag cctcttttca 69540 

agtcccactg acccccttgg agactctgtc cctgcctggc ttctctcctg gctgctggga 69600 

agagcaggcg aactgcccgc cctgaatgga tgctgcgctc caccctgggc cccccattgg 69660 

gcaggagatg gagcttggca gtcgggctga gcgggctcat gctggaaggg ccggggctgg 69720 

ggtcggggcc tcccctgcct gcagtgtggg tgtcagcgcc ctgctgccct ccaggtgctg 69780 

gagtgttcac caacacggcc ggactgcaga agctggcgga cattatccag gtggggcctg 69840 

ctcctctgtg gcatctcctt ccctgatgga agccgggcgg gtgccttctc ctgctgtatt 69900 

agttaactga ttctagactt ggggatggga gaaaggcccc tacaccacct gtttctgatt 69960 

ggcaaactct cggctccttt ccagtgccct aaacccacac tgggcctcct gcagggatgg 70020 

gggaggacga ggtctggtgg cacatgccca gggtgatgct ggtgagggag gacgcaaagg 70080 

acagtggggg ccggggagcc gctcctgccc tgtccgggcc ctcaggccag gggggaccca 70140 

ctgctggcag ccccagcagc cccagctgca cgcagatgaa gagctctgga cacacgcggc 70200 

ttcctgaaca gcttctccag ggacagacaa atggggaccc tgcaggttcc cggcaggggt 70260 

gtccctggga gcccatgatt gggggtgcga ccctggcccc cttctcattg gccccgtcct 70320 
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gccctgcaat gcccgtccca tgtgaggtct gcttctggct ccatgcctat ggcagcacct 70380 

gctttccctg gcgtagaggt gcttgtccgg tttgtggagg gcacgcccca ttttgggtgc 70440 

tctgggcacg ttgcctctcc ggggcctcgg tggctttttt agaagcagac tcagaagtcc 70500 

ctgactgggg aagccaaggc acaggtggct gtgtggagcc ctgtgaggcc tcctctgtgc 70560 

tgcccacgct gtacctgctg gccacacgag atcatggcag ggttaggcag ggctgcccag 70620 

cgctatgaca gcttcatgag tgtccatctg gcctgtgggg tgcttgagct gggggaggcc 70680 

gcagaagaac cctgggatgc atggctggcc tgtgcatgct gctgggcatg gagctgcaga 70740 

tcccggaaca agcaggcact gccttctcct tcacagacgc agctctgagc gggggcgaga 70800 

cctgggcagg gaccaggtgg ggtgggcaca gggtggtggg gcccaggctc agccctccct 70860 

ccactgtggc cgtctctgtg gccagtgacg ccacagcctg tgtcttctct gtgcggtagc 70920 

tggggctgga aggacagcac tgccttgtcc tcccaactcc tccccaaagg cacggtgggc 70980 

atcccaggcc cagacccctc tgtctgtggc tcctgcctgc caagggctgc tgtgctgtcc 71040 

cgcatggagt gtggttggct cttcaagcag gaggccgtgc acctatcagg cggacctgct 71100 

tccatgtccc tgatgggtca ctgcaaagca cctccagcac atggccaggc gaggtagccc 71160 

tgcagcccag ggcctggagg gcaggtgtga gctggcccgg gcctgtccct ccctggaata 71220 

cagcttccca ggctcccact tatggagaag tctcctccac actatggaac tgaatcctag 71280 

aatgtggctt ctgaggttcc tacactcgaa ctgaatcctg gaatgcggct tccaaggctt 71340 

ccagctatgg agaagactcc acactctgga accgaatcct ggaacgcggc ctcccaggcc 71400 

cccagctatg gagaagactc cacactctgg aaccgaatcc tggaacgcgg cctcccaggc 71460 

ccccagctat ggagaagact ccacactctg gaaccggatc ctggaacgcg gcctcccagc 71520 

ctcccactta aggagaagtc tccacactct ggaaccggat cctggaacgt ggcctcccag 71580 

gcccccactt aaggagaaga ctccacactc tggaaccgaa tcctgcacac tccatcggtt 71640 

tggaatttcc tttggctgct gctctaagta gccgctggtg gatgactcag cttctgccag 71700 

ccctcgggtg cctggaggat gagggactgc acacagtgct cacccgcgtt ggctcctgag 71760 

cccctgcagg tgtgggcggt gcccataggg ctggtgctgg gttgggcctg cagccctgag 71820 

tcacaggtga ccctgggggc agagtggggc cagtggcccc aggaagagga tgtgggatgc 71880 

acagctcagc tggaggcgaa ctccaggcag ggtcaggccg tgtgctcgga agtcagggct 71940 

tagctggagg caaactctgg gcagtgctgg cccgtgttgg ggaaccagtt gcccctgggc 72000 

ccccgtgaga ctgctgggtc ctcatccctc tctgcctgag gccggagctg ccctgggctg 72060 

aggcacaggg ggatttgtgg tggtgttttt ttgagaaagg gtctcgcttt gtcaccccgg 72120 

ctggagtgca ggggcttgat cacagctcac tgcagcctca acctcctggg cccaagtgat 72180 

cctcttgcct cagccacccg aggagctgtg aacacaggtg tgcaccaccg cactcagcta 72240 

atttttaaaa tttttttgta gagatgaggt cttgccatgt ttcccaggct ggtctcaaac 72300 

tcctgggctc aggcagtctg cccgccttgg cctcccaaag tgctgggatt acaggcaaga 72360 

gcttccatgc ctgcccagca gaaggctttt cgaaggaagc tgtttcctga ggcagactca 72420 

gccctgctca tggcagccac cagcgtgggg gtgaacttgt tctgttactt ccatccccgt 72480 

gggccaaatg ctttggtaaa acacaaggcc ctgtgtttag ctgtcttgac agtgaaaatg 72540 

gctgggaagg aaggaaggaa cggaaggaaa tttctctctc cttctgtgcg tacccaggca 72600 

cgtgcacatg catgcagagt acgcacacac gcacgcacgc ctgcacaaat ccacgcatgt 72660 

tgccaagtct ctgtgttcca gccgtggtgt ctgccccccg gtgttctcta gttcggcttc 72720 

tccgcatttc tgtgaatgat tccggcttct tggtgttccc agcagaactc cctcaagtct 72780 

gcggcggggc tctgacggcg gtggcttggc tgacatggcc acattgctga gcctgttggg 72840 

ggctttgcgt tcctgttctg gccgtttttg gctcgttttc caggaacggt cgtcacgcgc 72900 

tcctctccta gtgcaggcat cattcctttc ccattgattt gcagggttct ctgtaagttc 72960 

tgaggatccc atatacatat actctctgta agttctgagg atcccatata catattctct 73020 

ctctaagttc tgaggatccc atatacatat tctctctcta agttctgagg atcccatgcc 73080 

gacatacata ttctttcctt gtctcatgct ggtcattttt tccattttca tgacaggttt 73140 

ggtgaacaca tgtttccttg tcagattttt gttctgagct tgtgcctccc gaccaagatg 73200 

ctaaaccggg tcttgtgtat tctccaaact gcactgtaga gtgacggagc tttgtgtctg 73260 

ggcctccatg ccttctgacg tcacctgtgg gggtgtgaaa ggcagactct accttgattt 73320 

ttcccagcac gccacaccgg tggttctgtg cgctgaccga gcggctcggc ttcccccaac 73380 

tccactgggc acctgccaca cttttcctca tgtttttgtt cactgtggtt ttgtcgtaag 73440 

tcctggtgtt ggcctgaacc aatttctttt tgtttgtttt tgagacagag ttttgctctt 73500 

gttgcccagg ctggagtgca gtggcgcgat ctcggctcac tgcaagctcc gcctcccggg 73560 

ttcacgccat tctcctgcct cagcctccca aatacctggg attataggca cctgccacca 73620 

cgcctggcta attttttgta tttttagtag agacgaggtt tcaccgtgtt agccaggatg 73680 

gtctcgatct cctgacctcg tgatccgcct cccaaagtgc tgggattaca ggcatgagcc 73740 

accgtgccca gcctgatatt tttagtagaa atggggtttt gccatgttgg ccaggctggt 73800 

ctcgaactcc tgacctcagg tgatcctctc accttggcct cccagagtgc tgggattacg 73860 

ggtgtgagcc accacgcccg gcctcttgtt cttttgaaac ctgccctgac gttttttcca 73920 

tagtgcatct tggagtcagc gtgtctactt cctgtaaaaa tcttactgtg attttgacta 73980 
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gaatgtgttg aattcctgtt ttttttttga gtcagggtct ctctgttgcc caggctggag 74040 

tgcagtggga ccatcacagc tcactgcagc ctcaacctcc tgggctcagg ggatcctctc 74100 

agctcaacct cccaagtagc tgggaccaca ggcacatgcc accatgcccg gctaggtttt 74160 

tttttttttt tttttggtga acaccctggg gttgcaccat gttgcccagg ctggtctcga 74220 

actcctgggt tcgggcagtt tgctcctctc agcctcccgg agtgctggga ttacaggcct 74280 

gagccactgc actaggccat gttgaatttc tagattaatt tggggccctc aggggcacag 74340 

agaggagggc tgggccagtt ggcgggagga gaggcccctc gggctgccgc attttcagtg 74400 

catggagatg gcctatgttg ggggaacaca gagctcaccg ggggtccctg cagggaggag 74460 

aaagggtcag gcaggtgcca gctcctgtcc attggcctgg ggctgcatga tggcaggggc 74520 

cggtgaaccg atgacccctg ggtgtcctgt gaccttctgt gtatgcggct gatgctgcag 74580 

aaagtcgggt ggcctcaggc tcctgacggg gctgcacttc ctctgccttt cagattgtgt 74640 

tcagtgtgga cccctccgag ggcagccctg gttccccagc agggctgggg gccttacagt 74700 

cctataaggt aggggccacc tccaggaggc aggtggaggg cagcccttgt tccccggcag 74760 

ggctgggggc cttacagtcc tataaggtgg gggccacctc caggaggcag gtggggctgg 74820 

gggtcttctg gtcctaaaag gtaaggggct gcccccagga catgggcggg gcctccacac 74880 

tcctggtcct gtcccctcca ggtgcacatc catcctgatg ctggtcaccg gaggacggct 74940 

cagcggtctg atgcctggag caccactgca gccagaaagc gaggtacaga cctgggccca 75000 

cacgctcccc gcccgcccgg gtgcagtgcc cggcaccacc atgccacagg ctaggcacat 75060 

gcccagccgt ggatctcctg cccccatggg cctggccacc ttctccatat ccaggccaat 75120 

ccagagcatt ctcctcactg tccctctgaa gattggagtt actgagagac gtaggagatg 75180 

gcctgatggc accgtgacct gcccagagtc acctggttgg tggtggcaga gccacagccc 75240 

agccaggcct ccctgctggg acacgctcgt ttatgccgag gccgtcagca cagagcctcc 75300 

acagtgaggc acggctctgc ctgctgcctc cacgcagcgc ctggccgggc caagcctcag 75360 

ggtcacatct gaagggggcc cggctggccc tgttgtccga agcccctggt gcgctcagcc 75420 

ccgaggcccc acgtgccttc ttggcttcct gtgctccgtg gcgtcttcga gtcggtgctg 75480 

ccggggacgc tgtgtggatg gggtctgtga gtgtgccctc ggctccgtgt ccggagccct 75540 

gtggttcttg gggtgtatct ggccccaccc ccactgcgtg gtgtccaggg tggggcttca 75600 

cggctgcagc tgcgggagct gctgcccctg ccttgtgctc cagtggggcc ttgcctctgg 75660 

gcttggttcg tccctctctg gaacattctt tctcagctgc tgtccgaccc atggtggcat 75720 

gacgtggccc tggctgaagc agcccttgtg cggttgctgt ggttgggtct gcctggccga 75780 

gccggaaggg aagggctggg agggcgtcag ggtggcgtgg cttgaccccc gctcggtgat 75840 

ggtcctgcag caaggcctct cccagcagga agcgtccatc ccggggggag gccggcgccc 75900 

ctcacgcagt tggggttgcg ggaggcagtg cgtgcctgag gcagccggtg cacagattcc 75960 

aagggcctgg aatctgtttg ttccattgac ctctgatgtc acttgacttc tcagaagcag 76020 

ccactccctg cactgggcgt ttgtaggaaa tgagctcctg gaggaggggg tggggaagtt 76080 

cccccattgc agggcacact cagccccagg aaggaaacgt gcctcgtccc tgctgactcc 76140 

gaatcgcagt cagagtcgtt ctgcttgtgc cgtgttgaat tcccggcatc cggcatccag 76200 

actcagcctc ctccccaggc cacggccgcc gtggccagtc ggtcaagccc ttctaggaac 76260 

ttcctttgag ctggcgccct tgttcactgc tgacgccact cagaggcttg tgcacgtgtc 76320 

ctgcttccag gcagagctgg gaactcgcac cccgtcttct gcacgcggcc gtggaatgtc 76380 

gggatgccgg cgcttccttc ccgtgtgctc ttggcggggt gggcttcttg ccctgagccg 76440 

catgtcacag tttctgcaga agtttagggt tggagtgggc tgacctctct gcaggtgtcc 76500 

ccagcctctg cctggggtct gcctcctact cccaggaccc cctgtccccc agaggggccc 76560 

caagctggca ggctcacact cagggcagcc tcctttgttc tgacttctgc acagtgggcc 76620 

tgggtggctg cccgcggctc gcttgcttga tgccagtggg tggagagggt gatgggcaga 76680 

gaggcaggtg gtcaggcccc cagtcccgtc ctcacactct gtgccctctg ccgccccccg 76740 

ccccacaggg aaggtgctga gctactggtg cttcagtccc ggccacagca tgcacgagct 76800 

ggtccgccag ggcgtccgct ccctcatcct taccagcggc acgctggccc cggtgtcctc 76860 

ctttgctctg gagatgcaga tgtacgggcc acccctgcca gggcctgagc accggtgaca 76920 

cctctgacat cagcggggtg gaagtggtgg gggtccccat gagccgggtg ctgggggtct 76980 

cgggcctcga gggctaaagg ggtgctggtg cacttcccca ctgtctgctc cctctggcca 77040 

cgctcagccc tttcccagtc tgcctggaga acccacacat catcgacaag caccagatct 77100 

gggtgggggt cgtccccaga ggccccgatg gagcccagtt gagctccgcg tttgacagac 77160 

ggtgagggcc tgtccctggg ccctgctggg gtgggaggtg ggggagcact gaggcctgag 77220 

gccctgagca gtggcctctc cggctctagg ttttccgagg agtgcttatc ctccctgggg 77280 

aaggctctgg gtgagtgccc tgaatgcccc agctgtgcgc atcctggatc ctggacccct 77340 

gctcccaaga gctggtaggg acccctgcag acatcctgcc cctgccttga ccccggcccc 77400 

tgcacttcca ggcaacatcg cccgcgtggt gccctatggg ctcctgatct tcttcccttc 77460 

ctatcctgtc atggagaaga gcctggagtt ctggcgggtg cgtctcccct gtgttctggg 77520 

cggggtgggt gagggcaggg ctggagcatg aagcaggcag tggtcacagc tcctgcttgc 77580 

cctcatcgga tcggcggcgt gaccagggct gccgtgtccc tgcctcttcc tcccacaggc 77640 
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ccgcgacttg gccaggaaga tggaggcgct gaagccgctg tttgtggagc ccaggagcaa 77700 

aggcagcttc tccgaggtcg gcacttggcc ggggctctgg gcctgctgcc ccctcgtgcc 77760 

tcccctgcct ctcacagctt ccccaaggct gaccactggc cctgaccatg ggctccggcg 77820 

gctcccgctg cctcttcagg gctcctgcgt ttccttcctg gccctgagtg ttgcctctta 77880 

tcttacaaag cccccagcac cgggtgggtg tggtaacagt ggccctcctg tctgagtagc 77940 

cctagtcggc caccctggcc ctggggttcc ccgtgttttc tgggaagcac tgagcaggcg 78000 

tggggtcagc ctgggatccg tgccaggaag aagcttccag aacccgattg gccttcctgg 78060 

ctaggacgat ccttcatctt ggagcatgag acctgggtct ccctcatggg ggaggaaggg 78120 

gctggggggg ggctccaggc tcagcctcac caactttcct tccagaccat cagtgcttac 78180 

tatgcaaggg ttgccgcccc tgggtccacc ggcgccacct tcctggcggt ctgccggggc 78240 

aaggtgagct ctccagggcc ctctgccctg acctggttgc ctgttccctg gtgggtgctt 78300 

atggctcccc agcagactct gggccctggg ggctgcccgg tcccctcctt gggtcccacg 78360 

agagcgactg ctggccctgc tgggagcgtg tcctgctctg ggcctgggca ggcaggatgg 78420 

gagtttcctg gccacaagag ttggaggtgg cgtctgggag ctgtggaccc caagtggggt 78480 

cctgacccac agatggagct tcctcccacc cctggttggg gacggagcct cggggaaggt 78540 

ggctgggctg ggtgtgggca ccagggagag gagcccccac ggccccaggc agctccctgg 78600 

tgtgtcccct aggccagcga ggggctggac ttctcagaca cgaatggccg tggtgtgatt 78660 

gtcacgggcc tcccgtaccc cccacgcatg gacccccggg ttgtcctcaa gatgcagttc 78720 

ctggatgaga tgaagggcca gggtggggct gggggccagg tgagttacag cagggtgggg 78780 

ctggggtaag gcggtctggt gactgagccc ccgccccgtg gccaagggag cccccgtgac 78840 

cgagccgcct cgccccacag ttcctctctg ggcaggagtg gtaccggcag caggcgtcca 78900 

gggctgtgaa ccaggccatc gggcgagtga tccggcaccg ccaggactac ggagctgtct 78960 

tcctctgtga ccacaggtgc gtgcagtccg gtggcaggcg cggcgccagg ggacacgccc 79020 

acaccccact gggcccctgg actctccttc cccacatgag gccccgtctc ctccagagcc 79080 

tctccggcta ctcggggtca gcgtggggcc cctgcagcag atgagggtct tcacttcggt 79140 

gaactgaacc cttgaagcgg ctgtgggcag ggcagcaggg ctatggccac cccccaggtt 79200 

cgcctttgcc gacgcaagag cccaactgcc ctcctgggtg cgtccccacg tcagggtgta 79260 

tgacaacttt ggccatgtca tccgagacgt ggcccagttc ttccgtgttg ccgagcgaac 79320 

tgtgagttcc tgcccaggga ggggatgagg gtgttgtccc cagaggagcc agaaatgggt 79380 

ccacccaccc ccatggttct gcagatgcca gcgccggccc cccgggctac agcacccagt 79440 

gtgcgtggag aagatgctgt cagcgaggcc aagtcgcctg gccccttctt ctccaccagg 79500 

aaagctaaga gtctggacct gcatgtcccc agcctgaagc agaggtcctc aggtgcggac 79560 

gggcagcgct gggtgggcgg tgtgggggtg gcggagcggg cggcgtgggg cgggcagcac 79620 

caggcgccca gggcggaggc gactcacctg gctttgtgcg cttcccctcc cacctccaaa 79680 

ggctgcctct ccctcctagg gcagggcccc cacgggctgc aaccctcccc tacaggcaga 79740 

gaacgcccca ggcaaggatg ccccccgagg ctgagactcc ccccaatagc agggaggaca 79800 

cccacaggca ggaccccaag tgctgggact ctcccccaag aggggctttg ccacaggcag 79860 

ggaccccagc tggggccccc cgtgggcttc actgcgcact cgggtgcccc tgcagggtca 79920 

ccagctgccg gggaccccga gagtagcctg tgtgtggagt atgagcagga gccagttcct 79980 

gcccggcaga ggcccagggg gctgctggcc gccctggagc acagcgaaca gcgggcgggg 80040 

agccctggcg aggagcaggt acagttccag ggccttggga tggacacaga ccctctgtct 80100 

cctgaggcca acccgacccc gcccatctgg cctcaggcac ctccccacac acccctgtaa 80160 

atcccctgcc tggcaggcag gcgggcaagc gggcggggga tcccagctgc ctggctgtct 80220 

gtgggtcctc caccccacct cacccacagg ctgctggctc ccaggtggtg catgccctgg 80280 

ccctccgcgg gtgcccccca catcactttg gttctctggc gggtcagctt ggctcagtgc 80340 

actcaaggtc gggtgcccct gccactggct gcgcttgagg ctggcctttc tccaggaatg 80400 

tgctgcgggt ggaacccagg ttccttcttc cttggggcct tttgccccag aagcccataa 80460 

ttcctcaggc caacccgaaa ttttctccct gcttcctgct gggagccatt cccctcttcc 80520 

tgcccatccc tgcccttcag gcccctggag tgagctccag gtgcaggcac caggcacctg 80580 

tgtccccttc ctgccagccc ctcgctgtgg tcggactgtc ttccctggac ctgctcttac 80640 

aagtcaccac ctgcgagcct catgagccgc tggtgtgact tggacaggac caagttgtgg 80700 

cactgtcacc ggggtgtgct gtgcccccct cccccgacct ccatcttggc tcagggctcc 80760 

ttgggaccat cttccctgtg cgtccaggtg ctttgggacc ccagagtgtg tggttggggt 80820 

ctgtgtgtgg ttgtgagctg tgtcctcctc aggcccacag ctgctccacc ctgtccctcc 80880 

tgtctgagaa gaggccggca gaagaaccgc gaggagggag gaagaagatc cggctggtca 80940 

gccacccggt gcgtgagctg tccctgcacc tgtgccgacc accatagaca cgcatgggaa 81000 

cgcagccgtg ggtgccccca gccacggctg gtcccgatgg gaccagggaa tccaccccca 81060 

ggagctgatg tccagggcag ctgtgatgct gacggccagg ggctcaagtg tgtggtttct 81120 

tctgcagggg gctcatgagt cccagctgga atcaggcccc acccttgggc aggtttggca 81180 

tggggcctgc agcactgggc ttggccctgg catttccctc aagtgtggat gcacacctgc 81240 

ctcatgtgag ggacacagcc cattcctagc cttggatcaa agaacggagt tatagccgga 81300 
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gccaggaagc cccctgcctg ctggaaaacc ccaagtgtgg cggcctttgt ccatgtccct 81360 

tggcttctgg gaagaactgg gtggtgccca ggcagggctg gtgccatcag gaagtgggtg 81420 

gctgctgagg ggcctgggct ggcgagggcc tgggtgggga gtgcctgggc cgcccctgcc 81480 

ttggtttcca cgtttccgtg ttggtctggg gtgtgtagag agatgggcac tgctcatccg 81540 

gaagcccctc cttgtgcgct gccatcctgg gagcctcagc cgcatccgct gtggggcagg 81600 

gggcttgagg gaggaggaga gagacgggcc atgcaggacc cctggcttga ggcagagcca 81660 

atctaccctt tgcccattca ctgctctcag ttccctgcca gcctctcact gtgtgacctc 81720 

agacgggccc agccccacag ctttcttccc gcagcccctc cctatgtcca tccagccagc 81780 

cagtttctca ggcagcagcc ccacctcggc agtcactgtc ccagggaacg ctcaatgttc 81840 

caaggaaggc tctgcagccc cagggaccag atgatgaggc tggccctgat ggagcctcgg 81900 

gcctgtgtcc tgcaggagga gcccgtggct ggtgcacaga cggacagggc caagctcttc 81960 

atggtggccg tgaagcagga gttgagccaa gccaactttg ccaccttcac ccaggccctg 82020 

caggactaca agggttccga tgacttcgcc gccctggccg cctgtctcgg ccccctcttt 82080 

gctgaggacc ccaagaagca caacctgctc caaggtgccc tggcttgcag aggccaccca 82140 

ccctgagggc agtgctgccg ccgcgtgtgg ggtgggggcc atctgggtcc aaggtggtct 82200 

ctgttctcta gagaaaaagg ggcagatggg gacagacgcc ccttcctcta caggcttcta 82260 

ccagtttgtg cggccccacc ataagcagca gtttgaggag gtctgtatcc agctgacagg 82320 

acgaggctgt ggctatcggc ctgagcacag cattccccga aggcagcggg cacagccggt 82380 

cctggacccc actggtaaat ggggccccag gtgggaccct cagactcctg cgtggaaggc 82440 

agtgtgggcc agagtcctgg gctgcttggg gtgggcatcc tcgggccctg cttggccccg 82500 

cctctctgtt cccctatggg agtgatgggg gcctccacct ccaccaccag caccagcagc 82560 

accacctcca ccttcaccac caccacctcc accaccacca cctccaccac ctccacctcc 82620 

accacctcca ccacctccac cacctccacc accaccacca cctccaccac caccaccacc 82680 

accacctcca ccaccaccac caccaccacc acctccacct ccaccacctc caccaccacc 82740 

tccacctcca ccaccaccac cacctccacc tccaccacct ccacctccac ctccaccacc 82800 

accacctcca ccaccaccac caccacctcc acctccacca gcagcagcat cacttgttgg 82860 

ggagaccctg tgcaactcca tgcacagccc tgtccctgcc atagccccga cccctaagca 82920 

cagccctgtc caactgccac acgtcccctg cctcccatgc atggtcctgg ggggtcaact 82980 

gcacacgcca gggtcctagg gtcctagacc cctgtcctcc ctgtttctgc ctctgtttgg 83040 

ggtggagtcc aagtctccag aggcggaagc atctgtgttc gtgtgttaat gaacagcccc 83100 

tacagagttc ccctagttca cccagggggg aacctagcct gttgggacga ccccagatcc 83160 

cttctgggct tggtactcac tgggatatcc tcatgcctgc acccagccta cggctctgag 83220 

ctcctgagtg gggctttggc ctgcccgcca ctgttccagc ccccatccag caggctggtg 83280 

tctcctctga tgcccccagc acccaggcgt gtacctgcct gggttttccc gccctggtct 83340 

gaggtgggtg aggcctggcc tccctagcca gccctgcccc cccaccccag ggaactttcc 83400 

agatgctccc gaccagcttt gtggctctac atctcttcat caggaagaac ggcgccggat 83460 

cccaagctga ccgtgtccac ggctgcagcc cagcagctgg acccccaaga gcacctgaac 83520 

cagggcaggc cccacctgtc gcccaggcca cccccaacag gtagctgact cctgaaccgt 83580 

gtgcagccta cgacttggtg ggtccctcag tggcttcacg aggctaactc ttgagtgtgg 83640 

ccggggctgc ccctgtgggg agccatctca tggtggggac tgctcccggt tctgcacccc 83700 

gcagttgtcc tgagcagctc tccaggagtt cctggaggaa gggcgggcag ggcggtggga 83760 

ctctcagtcc tccaccccag cgccactctg agccatgcta ctcccacacc aggagaccct 83820 

ggcagccaac cacagtgggg gtctggagtg cccagagcag ggaagcaggg ccagcacgcc 83880 

gtgagcgcct acctggctga tgcccgcagg gccctggggt ccgcgggctg tagccaactc 83940 

ttggcagcgc tgacagccta taagcaagac gacgacctcg acaaggtgct ggctgtgttg 84000 

gccgccctga ccactgcaaa gccagaggac ttccccctgc tgcacagcaa gtggccctgg 84060 

cgtggggaac agccggtggg gtggggggca ggggacaaaa tgggggctgt gccgggtctg 84120 

attgaagctc cccgcagggt tcagcatgtt tgtgcgtcca caccacaagc agcgcttctc 84180 

acagacgtgc acagacctga ccggccggcc ctacccgggc atggagccac cgggacccca 84240 

ggaggagagg cttgccgtgc ctcctgtgct tacccacagg gctccccaac caggtagggc 84300 

acctgcctgg ctgctcctgg cagcgcccca accgcacgca gccctgggag tgagcagcaa 84360 

agccccaggc ccccctcaga ctcaagtctc tgtctccagg cccctcacgg tccgagaaga 84420 

ccgggaagac ccagagcaag atctcgtcct tccttagaca gaggccagca gggactgtgg 84480 

gggcgggcgg tgaggatgca ggtcccagcc agtcctcagg acctccccac gggcctgcag 84540 

catctgagtg gggtgagcct catgggagag acatcgctgg gcagcaggcc acgggagctc 84600 

cgggcgggcc cctctcagca ggctgtgtgt gccagggctg tggggcagag gacgtggtgc 84660 

ccttccagtg ccctgcctgt gacttccagc gctgccaagc ctgctggcaa cggcaccttc 84720 

aggttggtgc ctggccacta cagttcctgc tgggtgtagc cccaggtgat gggctgaggg 84780 

ggaaagggca ggcccttgtc ctggtggcaa cgcctggcag acgtgtgcag tgggccggtt 84840 

gtctcacagg cctctaggat gtgcccagcc tgccacaccg cctccaggaa gcagagcgtc 84900 

atgcaggtct tctggccaga gccccagtga gtgcccacgg aggcccccag cacacccaac 84960 
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gtggcttgat cacctgcctg tccagctctg gtgggccaag aacccaccca acagaatagg 85020 

ccagcccatg ccagccggct tggcccgctg caggcctcag gcaggcgggg cccatggttg 85080 

gtccctgcgg tgggaccgga tctgggcctg cctctgagaa gccctgagct accttggggt 85140 

ctggggtggg tttctgggaa agtgcttccc cagaacttcc ctggctcctg gcctgtgagt 85200 

ggtgccacag gggcacccca gctgagcccc tcaccgggaa ggaggagacc cccgtgggca 85260 

cgtgtccact tttaatcagg ggacagggct ctctaataaa gctgctggca gtgcccagga 85320 

cggtgtcttc gtggcctggg cttggtggtg ggagttgagg gacagggagt tggcagaggc 85380 

ccctcccagc ctgccatgtg acactgtact tcctccacgg tgggctcagc cctgccctca 85440 

tcctcacagc cgcagccaag ctgcagttgg taggggatcc accgacacac caggctgcct 85500 

gggctggtct ctgggttggg agctgcccca ggtgctgagg agggcagctc cctggctggt 85560 

gaggcccctc ccagaaccac ccttggactg agctctgggg agggatggta ccaggtgggt 85620 

gaggggggct gcctggggag ggaggggttc ctatggggcg tggcgaggct ggcccagccc 85680 

tctccccgcc catatatgta gggcagcagc aggatgggct tctggacttg ggcggcccct 85740 

ccgcaggcgg accgggggca aaggaggtgg catgtcggtc aggcacagca gggtcctgtg 85800 

tccgcgctga gccgcgctct ccctgctcca gcaaggacca tgagggcgct ggaggggcca 85860 

ggcctgtcgc tgctgtgcct ggtgttggcg ctgcctgccc tgctgccggt gccggctgta 85920 

cgcggagtgg cagaaacacc cacctacccc tggcgggacg cagagacagg ggagcggctg 85980 

gtgtgtgccc agtgcccccc aggcaccttt gtgcagcggc cgtgccgccg agacagcccc 86040 

acgacgtgtg gcccgtgtcc accgcgccac tacacgcagt tctggaacta cctggagcgc 86100 

tgccgctact gcaacgtcct ctgcggggag cgtgaggagg aggcacgggc ttgccacgcc 86160 

acccacaacc gcgcctgccg ctgccgcacc ggcttcttcg cgcacgctgg tttctgcttg 86220 

gagcacgcat cgtgtccacc tggtgccggc gtgattgccc cgggtgagag ctgggcgagg 86280 

ggaggggccc ccaggagtgg tggccggagg tgtggcaggg gtcaggttgc tggtcccagc 86340 

cttgcaccct gagctaggac accagttccc ctgaccctgt tcttccctcc tggctgcagg 86400 

cacccccagc cagaacacgc agtgccagcc gtgcccccca ggcaccttct cagccagcag 86460 

ttccagctca gagcagtgcc agccccaccg caactgcacg gccctgggcc tggccctcaa 86520 

tgtgccaggc tcttcctccc atgacacgct gtgcaccagc tgcactggct tccccctcag 86580 

caccagggta ccaggtgagc cagaggcctg agggggcagc acactgcagg ccaggcccac 86640 

ttgtgccctc actcctgccc ctgcacgtgc atctagcctg aggcatgcca gctggctctg 86700 

ggaaggggcc acagtggatt tgaggggtca ggggtccctc cactagatcc/.-icaccaagtc 86760 

tgccctctca ggggtggctg agaatttgga tctgagccag ggcacagcct cccctgggga 86820 

gctctgggaa agtgggcagc aatctcctaa ctgcccgagg ggaaggtggc tggctcctct 86880 

gacacggaga aaccgaggcc tgatggtaac tctcctaact gcctgagagg aaggtggctg 86940 

cctcctctga catggggaaa ccgaggccca atgttaacca ctgttgagaa gtcacagggg 87000 

gaagtgaccc ccttaacatc aagtcaggtc cggtccatct gcaggtccca actcgcccct 87060 

tccgatggcc caggagcccc aagcccttgc ctgggccccc ttgcctcttg cagccaaggt 87120 

ccgagtggcc actcctgccc cctaggcctt tgctccagct ctctgaccga aggctcctgc 87180 

cccttctcca gtccccatcg ttgcactgcc ctctccagca cggctcactg cacagggatt 87240 

tctctctcct gcaaaccccc cgagtggggc ccagaaagca gggtacctgg cagcccccgc 87300 

cagtgtgtgt gggtgaaatg atcggaccgc tgcctcccca ccccactgca ggagctgagg 87360 

agtgtgagcg tgccgtcatc gactttgtgg ctttccagga catctccatc aagaggctgc 87420 

agcggctgct gcaggccctc gaggccccgg agggctgggg tccgacacca agggcgggcc 87480 

gcgcggcctt gcagctgaag ctgcgtcggc ggctcacgga gctcctgggg gcgcaggacg 87540 

gggcgctgct ggtgcggctg ctgcaggcgc tgcgcgtggc caggatgccc gggctggagc 87600 

ggagcgtccg tgagcgcttc ctccctgtgc actgatcctg gccccctctt atttattcta 87660 

catccttggc accccacttg cactgaaaga ggcttttttt taaatagaag aaatgaggtt 87720 

tcttaaagct tatttttata aagctttttc ataaaactgg ttgtagttgc acagctactg 87780 

ggagggcagc cggggacacc tgagccgccc gctgtgccca gatccctcag gctgcctgcc 87840 

atcagaactg ctgcccgggg cttcccctac ctcagacaga ccctccctgg gaggatcagt 87900 

ggggagtgcc acctctgccc ccagtggctg tggcacgtgg caggggcccc tgaagctcag 87960 

cgagggtcag ggcctgggag ggtatcattg ctggaagaac aggatggggc tcaggccagc 88020 

cctagtcgcc ggggcccaca ctaacccccc acttatgaat tcctcccact cccaactcac 88080 

aggggatttc ccgagagggg acctgccaaa gacctcctcc aggcctccca tgcttcccgg 88140 

gaagtgaagc ttctccccct ctggggcagg ctctgaagcc tcccgatgca cccagagcaa 88200 

ccagggggct gcaccagcca ctcgcctccc cagcacggcc aggttcccgg ggctggaggt 88260 

cccccccagg tcctgggaac caacctgcag aacacacaca gggtcccctg gagaggacgc 88320 

ggggacttcc agggcccgac tcctgtgagt cacagccccg cagctgctgc gccaccccca 88380 

ccctgactca tgccccttcc cagcagctcc tcccaggacc ccatgtcctt cccacatccg 88440 

caggaaggga gtgcctggac tctccaggcc cacctgggga gcccctcacc tgcccaccag 88500 

cccctgagca gcccagtaac accatcaccg tgtccaacag ccaggagcct ccaccctcca 88560 

ggagggaagg gatggacaga gccacactcg ccgtctttat tttgcactca ccctgggtga 88620 
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cactgggcag gccgctcctg cccacagcca 
ccagtggggt ccgtgcaggg aggaccccag 
gggccccaca gagaacccct ccgggaggtt 
ccacaaacac tccccaactt gcggggctgg 
cctcagcttt ttgcatcagt cagctccctc 
ccaggcgtgg ggtataggtc ttcccccgca 
aatccagggc ccccgctggc ctttgggaac 
gccccagacc acttccgggg tcacggggtc 
tgacgggcct cctcctcgat ggggcggaga 
ggctgtcctc atgcagccca agccagcctg 
tccctcagac cccccagaaa gggcctcgaa 
ccgctgcagg gcctgggcca gccgggctgc 
acgcttttcc agacatagag gaaagtttgt 
ccaaagcaaa gcaaacccac cccccagatc 
ctccagcacc aggggaccag ccgtcccgac 
tctgccgcgg cggccggtgc acattccgca 
cgcgcacccc tttgctgtga agacagcggg 
ccccgcaggt gcagccccca ctcaccctgt 
gcgcctgccg atcttgctgg tgcagtcgct 
gcacgtctgg gggaggtaag gccgtgagga 
gtgggcccgg gactctcaga agggcgtatg 
caccagggtc ccagtgtctc ccacagagac 
aggctgaaca tggctggtag tgcctgagac 
ccacccccca tcccctccct gggggacacc 
aggcctcctg ggcagtggcg gaggcaggtc 
gaggtgggag ccctgcatca gtgatggggg 
aaccacctgc ccagtcttca aaaagcagcc 
ccctgccccg taccagatgt ccctcttgtc 
aggctggggg caacagagcc accccagaga 
tggacgccgg gctgcttctg tgcaaagcca 
ccagtgctcc cgggggcagt ggcacctcag 
ggctggcaca gatgccacct ccaagggtag 
caccagggcc ccagcatcac ttctttgagc 
tggccacaca ctcctgagac acggccgccc 
agaggcctct gggcgcctgc cctgaggtgg 
tggagaggaa ccaagcttgg gaagctgctg 
taccccctgg gcagtcactg ggcctggggt 
gggggaggct ggggactcgg gaagctgctg 
tcctgcttgt tggccagcac caagacgggg 
ttctctgggg agggcaggag aggcagcgcc 
aggggcccac aggcgtggac actgtgacag 
ctaggcagga gcacttctga ccagacactg 
tggacctgcc cccactcacc atccatccct 
gcctgcttgg actcagccag cctctcctcg 
tgacactccg cataatactg ggaggaagca 
caagggccag cagagccagg cctgtgtcat 
ggcaagaggg cagcccccaa ctgcaagacc 
gatctggtgg aggtgtcccc agacgctgct 
atggtggcca aaggtgaaag acagggattg 
ccaacacttt gggaggcaga agcaggagga 
gcaacacagt gagactccgt ctgtacaaaa 
ctactcggga ggctgaggtg ggaggatggc 
ctctgatcac accactgcac tccagcctgg 
agatagggtt tgggggctgg aggaacctag 
ctgtggggct ctgtgccagc acctcgcaca 
gaccagcacc atctacgaag cgagccttcc 
cagacctctg cagcctgggc tggtcagtcc 
cagggcacag cccctgcagc ctcaggtaca 
ctctatctca atcagaaaaa aacacagcaa 
tcacatttgc cactttgcct ccaggctgtg 
gagacctaga actggatgct gaccaggtag 



gactgaggaa gaacacagca ctcggcaggc 88680 

gaccagcctt actcccgagc aggggacaca 88740 

ctctcctggc tgggggaggg ctctggaccc 88800 

ggcataaaaa cagccactcc cagcaggccc 88860 

ccgggggatt agggtgaggt gaagccaggc 88920 

ggcctcagcc ctgtcccgag gctgcatcac 88980 

atggcctggg tcttcctcaa ggcaagatca 89040 

acagggcaga agccagatgg cagccatggc 89100 

cagccacggg gtctcccgag ggtcccacag 89160 

agcactggag ccccaattcc caaccaggtc 89220 

aggccgccgc tgcgccctgt ggaaaggctg 89280 

cagactcccc tccaaagcct ccggatgcct 89340 

cttcgagaaa acaaagtaaa tagaagaacc 89400 

agcagcatgg gagccaacag gaggccactc 89460 

ggcagcgcgg ctgcgcctac gtgatgtccc 89520 

cgacacactt caccatccac tcgatgccct 89580 

tgtgaggcgg ggggtctcgg tccccaaagc 89640 

gagggccgag caggcctggg tcaggcaatc 89700 

gaaggccgtc ttgatgtcag ggattgagag 89760 

gcagccccca cgtctggccc tgtcctgcct 89820 

cccttcaccc cagggaaaca gccagagctc 89880 

cacagcagtg aggaccctgt gctcagcccg 89940 

aaactagacg tccacacggc tccaaggagt 90000 

ctgagccccg aggtggggcg ctgaggactg 90060 

ccaggggccc acacagccgg ggatgatgga 90120 

cagtctgcag tcatggtggc ttctgctcac 90180 

ctcccctccc cttttcctcc gaggggagac 90240 

ggctgagatt gtaggggagg ccagccttac 90300 

aggcaggaag tgaagattca cccggccctc 90360 

ctccaagaga acagctagaa ctcagcgtgg 90420 

aggggtcttg aggggctgcc ctgggggtgg 90480 

caggaacagg taagggtcag agctgactcc 90540 

tctgagtttc acctgggtgt ccccacagct 90600 

tcctggggag aggtgccctg catagcagga 90660 

gagaacctcc agggctggca gcagcaggtc 90720 

ggggcagggc aggccttgag aatggctctg 90780 

gtctgggtgc acacctactc cccttgctgt 90840 

cgggaggcag gggtggggct cacctccaca 90900 

acaccgcaca gcgcctcgct ggtcaccacc 90960 

tcacacccag catcctgcct ctgactgccc 91020 

ccactccctc tgcccccccc ccgtcaccca 91080 

agcctgcccc aggcacagag ctgcccaagc 91140 

cccagagcag ccaggccgca ctcaccaaac 91200 

tcggtggagt caatgacgta gatgacgccg 91260 

ccaggagttg gggctcagtc cccaccctgc 91320 

ggccacagtg aggggctcac atgaggaagg 91380 

cttctgggat gcattctggg gttgcgggga 91440 

cctgagaacc tgccggcaac ctttggcctg 91500 

ggccaggcgt ggtggctcac acttattatc 91560 

tcacctgagc ccacttcacg gccaacctgg 91620 

gcttatggta atgtgcgcct gcagtcctag 91680 

ttgagcctgg gaggttgagg ctgtagtgag 91740 

gtgagaatga gagaccctgt ctcaaaaaaa 9i800 

accacagcct ggcccgttga gggagtgcac 91860 

gggagggagt gtggccatgc ggataagact 91920 

ctgccaggac agggccagag tcactgagct 91980 

tgggctcgct ggcaacactc ctgggcaaga 92040 

agtgctgagc cctggaccag atgagtgcac 92100 

actccgcgtc cacgtggagc agacaacagc 92160 

ccagctctcc tgtccaggca tgagtgccca 92220 

gacaagctgg tggtcagtgt gttaagacac 92280 
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acacacccga gagcatgaga agccaggagg cacagcccaa ctctccgaaa tccttagggt 92340 

gtctgagcag ggagtaccag acaaccccat cccagtgcca gacaagcttg tgcacctgca 92400 

cttcccacag aggagagaag cctgtgcacc tgcacttccc acagtggaaa ggaggaggcc 92460 

caaggccagg cccccccacc cccaggaact tcccacagtg gagaggaggc ccaaggccag 92520 

gcgccctcca gggttctgca ggtagcgagg cccccccacc cccaggaact tctctggcct 92580 

acagacaggt cccacacaga ggccgccaac ccctcaaggg accctgcagt gtgccggctg 92640 

tctgctgctg acacaaggga gcaggcggac cctaaggtgg agacctctgt ggcaggaggg 92700 

gcggctctgt ggaggctgca gcaagcccag tgagagaatc tccacgtggc tcctggggct 92760 

tctgagcagg gtggcagaag gttcatgtgc aaccgggtcc tggaccatgg gaccacgtgg 92820 

ccagagccac ccatcacacc taccaggcac aaggtgcaca gcccagcagg gccgcagtgg 92880 

acgggagcga cacctcaggg ctgagtgcgg gcaggaccca gagccccacg ccccagtgga 92940 

ggcgtcacag cagtggtcat tgtggggtgc cccacaagga gggggaagag ggaggtgtcc 93000 

cagcgtggct cctggctggc cagctgaccc cagtggagca gtcagaggga ctgtgggtct 93060 

gagtttttct ccccagcagc aatgggagct ccccaactgc aaagtgccag ccagcctgag 93120 

agactagtgt tacagcaaag aacccaggag ctgaggtcct ggcacatgcc acacatgtgg 93180 

acaccaaccc agggtccagc cccaggacga ggccaattcg caatgacgcc cctttctgtg 93240 

gtgctggctc tgcacaagga tgcaggatac aggaaccagg gtgggagcag gggcctccct 93300 

tccggtccct cccagtgacc taggggggtc cctgcagctg atcctcccag ctctgagctc 93360 

agcagggtca ggggtcccgg ccactagagc agcacatact cagcagacac gctgaatgac 93420 

gagccacagc tgcctcatgg gcatgacttg cacctcatgt ctaggagacc ctggtgggca 93480 

ggagatgggg ctgccatccc acagctgtcc cacagctggg gacccaggga gccactggcc 93540 

ccaccacggt ggtgtctgga gaagggctca gactgccagg aagtcgcacc ccagcagaag 93600 

tggtagtgaa ttgggagggc actcaaggaa gggctgtgca gccccaagac cagcagcaag 93660 

gatgggctac agtggccccc ttaagtctcc ctcttccagt ttcgccttaa gagaggccct 93720 

caggaccttg gaggaacccc tctccaacgt ggaagtgtgg gtccacatag ggctgcagct 93780 

gtggccagtg caggcatctc tggccccact gtattcttgc ttcatgttgg agaacactgc 93840 

accagcagat ggtctcattt tggtttctgt gggacccact ttggctgcaa agagccacac 93900 

tgccaggtca cacctgccca gggcagccca cactggggac ccaccaggcc atggtgtgaa 93960 

gtcccggcca gcctggcccc acatggcaca gcatagccag ttctcctcca gggctccctg 94020 

ctgggccaac cacagctctg cggatcctgc tgcctgagtc gacctctcct ctcccgccct 94080 

ccctgccttc ctggtgccga cccccagtgt gcatcctgta cctcgacctg tctcagcatc 94140 

tgtgcctgag acaccggcct gtgacaagat catcatcatc tgtgtcactc cccaagcatg 94200 

ctgcgcactg gacacacagg ccctgactca acttgtcctg tctgacttca gtggtcctac 94260 

aggatctatc agagatcact tggccatggg agaaatgtct tcttggctag aagtcacagc 94320 

aggaggggac actttggggg cgcctaggaa aggggaacta ggatcaaaaa agagatcagg 94380 

acctgggcac tcagctctag agatggcatc agggcagcca aggcactggg gacaccccac 94440 

acccactgtg ccagcctagg gcagggagcc cgaggaagcc acaggctctg ccctgctcag 94500 

tgctggactc agtgcctggc ccaggctgag aaggagataa actgcagcct tgggggtgtg 94560 

gggaaggggc accacactgg gatctcagaa atgcccaaaa cctgtgtcaa aataggagac 94620 

tgccgctgtg agaccctgag gagtcttctg gtgatcatgg aagaacaaat gttaagctag 94680 

aactgaagga acctcatcag gggagaggca gccatcctgc cgtccccaca tctggtcttt 94740 

gccatttctg tgtcctgtgg tggtcagcag caaggtctct gagccgaaag gaggcactca 94800 

ctttggagga gtgcagggtc cccaggtccc cacactttgt cttgtcctga ctgagaaaga 94860 

aacagactgc cctgacctct ctgacttggc cagcgaggtt gcccttaggc tcaaacccaa 94920 

gccagggttt gaacattccc agacacttgt aagatgttta ggttgttaac ataatgttca 94980 

ggtttcaaaa cattgaaaga aactagcccc agccctgaac ccagatcccc cccggcttca 95040 

ggcatgacca gtgaacacgc ccttctctca ctggtcacct gaggatgccg cactctgtca 95100 

acaggttccc ctaatacatg ctctgatctg atcgccttgg catttagtga ttctttccct 95160 

ggaattctcc actggcccca tcgcagggaa ctcccaagtg ggaaactccc ctaccaccac 95220 

ttttggggca acttcagcta agggttcagc tgggacaaaa cagggagcca ctcgggaacc 95280 

tgggacagga ccagagagaa aacccgaggg acagagtggg taaggaaagc tgctgaggaa 95340 

gggcccaaag ggcactctgg aaagaagtgg cactggaggg ctggggtggg ggtggtcctg 95400 

gccagggagt cttaccttgt cccacaaaga ctgcagctct tcctgccctc ctaagtccca 95460 

gaacatgagc cgagcctttc ccacatccac agtgccgact ggggagagga ggaaacaggc 95520 

aaggctcatg accttggtcc tcgacacacc cagtcccagc tctcccaggg gatggggcaa 95580 

accatgctgg tgccactcaa atgagacttg agaggggccc gacagggctg tggccacggg 95640 

ccagctggac tgtgaatatc acggcatcct caaggcccca aacccacagc ctgctattga 95700 

gacccttact gtttaggccc acggtggtgg tgattttgga tagactcatc cccttgtagt 95760 

tcttgttaaa tcgggttttc gactgctcca ggaaggtctg aggagagagg cagaggcgaa 95820 

acacatcaag gaggggctat actggcttcc aaatatcctt actcaggtct gttctttaaa 95880 

agacagaaac agaaacagag caacactctg ctcttcagga ggctggtggt gactatcctg 95940 
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ccgtctcagg tgaaatttgg cttccgtctg ggtagtgaac gtgcagctga cagcacaaaa 96000 

ccgaaggggg cgccgccagg ccgtgggaaa ggtgcgcgca agggcgtggg cactcaccgt 96060 

cttcccagca ttgtccaggc ccaggatcag gatgcagtac tcgtccttct gaaacatgta 96120 

cttgtacaag cccgacagca gcgtgtacat cctgccctgg gcaccccaac ataggtcagt 96180 

gtgcagccag aaagcacctc ccctcccccg ggcttctcca cggtggtcag tggcgcccca 96240 

cgtccagccg accgctcagg acgagagcct gggggccatt cccgactcct cgtccctctc 96300 

ccaccccgtc cctctgtaac ttctcccagg tcagccgcca ctgtgtcctg ctcacagcaa 96360 

tgactgcgac ctctccgcat acacatcggt tccggcccct cccctgctcg cgggactacc 96420 

cagccgggtg ttcacagtga gctcagccgc gctcccgccc tcccccgagg cttcgctccc 96480 

acgcttcacg cgcgcggaac ggggaacaca ctcgctgcag ccccgcctgg gccacggcac 96540 

cctcgagcgc cagccccgcg ccccacccgg gagcagcgag ccaccggcgc gctccccagg 96600 

agcccctgca ggcgccgggt agggacgccc catcacccca tttcttaaaa cggggacggc 96660 

cctgggggga gcggactaca gggcgggtga gcagcggcgc ggctgctcct ggagtgcacc 96720 

tggaggcggc gcgcggctgg cagggaacga ctgcgaagga agaacctggg tcgcggcccc 96780 

cggctacgtc cgccccaagc cgccgccgcc aggtctgagg ctccccgaca agcagccaaa 96840 

gctggctcct gtcacacccg cgtcccacct cgagtcctgg gccgcccctc gggcctcgcg 96900 

cctcaccgca cagcctgcgg cctacctgcg tccgccgcgc cctcggagcc gctgctgctg 96960 

acccccgctg acctccgctg accccgcgct aaccccgcgc ggcgcctgac gggacgcggg 97020 

ccggcctcag ggaatgagct gaaccgcgtc ccagcggcct ccgcgctccg cttcccggct 97080 

gcccccgcgc gccaagcact tccggaagcg gcggcgctcg ggaggaagtg ccgatcggct 97140 

gctggggcga aaagggggcg ccgggccgct ctagccggtg aggccggcgg gctctctgtg 97200 

gctgcggctg ggaaaccgcg cggaggaggt gcccggccgg ggaccaggtg gccgcggttt 97260 

gcggggacgc ggccctggcc agacagaaga gacgccgggc gggggggcgc ggccggcctg 97320 

gaaggcggcg ggcgcggcgg gtgggctcgg cggagggtga ggcggcgggg cgccccgcgg 97380 

ggaaggggct ccggagtgac gcgggacccg gctagcggcg agcccacggc ggctcggaag 97440 

ggaagcgcgg agcctgagcg ggggtacccg ggctgcgacc tctgcgctgg gagctgtgcc 97500 

tctgagccgg tgtctccccg agggaaaggg gacgtgcccg tgcccgtgcc cgccctcagg 97560 

ctgtggggtc ggtcccgaga cgcggggctc agctggcttc tcttcttgca gccctggtcc 97620 

agcgcctccc tctctcagca tggacgagga gagcctggag tcggccttgc agacctaccg 97680 

tgcgcagctg cagcaggtgg agctggcctt gggcgccggc ctggattcgt ctgagcaggc 97740 

tgacctgcgc cagctgcagg gggacctgaa ggagctcatc gagctcaccg aggccagcct 97800 

ggtgtctgtc aggaagagca ggttgttggc cgcgctggac gaagagcgcc cgggccgcca 97860 

ggaagatgct gagtaccagg ctttccggga ggccatcact gaggcggtgg aggcaccagc 97920 

agcggcccgt gggtccggat cagagaccgt tcctaaagca gaggcggggc cagaatctgc 97980 

ggcaggtggg caggaggagg aagagggaga ggacgaggaa gagctgagtg ggacaaaggt 98040 

gagcgcgccc tactacagct cctggggcac tctggagtat cacaacgcca tggtggtggg 98100 

aacggaagag gcggaggatg gctcggcggg tgtccgtgtg ctttacctgt accccactca 98160 

caagtctctg aagccgtgcc cgttcttcct ggagggaaag tgccgcttta aggagaactg 98220 

caggtaaagc cctttgttgt cagatgccaa ccttaggggc gtaaggggca cgcacacagg 98280 

gtcgggtcag gatcggccct ccctttgctt tgcagttttg tctcagcttc ctggggcagg 98340 

cgtgctttga cagctgtgtc tgtgttcagg cgtctacgtc ttccttctgg ggtgaatcaa 98400 

gaagcatgga aggaggccag gcgcggtggc tcacgcctgt aatcccagca ctttaggaag 98460 

ccgaggcggg cagatcacct gaggtcagga gttcaagacc acgctggtca acatggtgaa 98520 

accccatctc cttaaaaaca caaaaatgaa ccggtcgtgg tggcgcgcac ctgtggtcct 98580 

ggctactcag gaggctgagg caggagaatt ggttgaaccc aggaggccga gtttgcagtg 98640 

agtggagatg cagccactgt actgcagccc gagcagcagt gcaaggctta tgtggaagag 98700 

agtaggtctc cagcctatcg tcagtttttt tttggtggtt gttttaattt tttttgagac 98760 

agggtcttac tttgtcaacc aggctggagt gcagtggcat agtcctggct cactgcagcc 98820 

tggacctcct gggctcaacc gatcctcctg cctcagcccc cctaggagct gggctacaga 98880 

ctcacgctac tacacccagc taatttttat attactataa ttttttatct tttttttgag 98940 

acggagtctt gttctgttgc ccaggctgga gtgcagtggc gtgatctcgg ctcactgcaa 99000 

gctccgcctc ccgggttcac gccattctcc tgcctcagcc tcccgagtag ctgggactac 99060 

aggcgcccgc caccatgtct ggctaatttt ctgtattttt agtagagacg gggtttcacc 99120 

atgttagcca ggatggtctc aatctcctga cctcgtgatc cgcccacctt ggcctcccaa 99180 

agtgctggga tgacaagcgt gagccaccgc gcctggcctt ttttttttgg agacagagtt 99240 

tcactctcct cacccaggct ggagtgtagt ggcgcaatct cagcttaccg caacctctgt 99300 

ctcccgggtt gaagtaattc tctacctcag cgtccagagt agctggcatt acaggcgccc 99360 

gccaccacac tcggctaatt ttttgtattt ttagtagagt cggagattca ccatcttggc 99420 

caggctggtc ttgaactcct gacctcgtga tccacccacc ttggcctccc aaagtgctgg 99480 

gatcacaggc gtgagccact gcgcctggcc ctgttgttag ttttattctc tagagttcaa 99540 

cttttaaatt ttactttcat ggagattttc aaacataccc caaattagag agtttagcat 99600 
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aatcaccgcc 
tttgtctgtt 
aagttactgg 
aggaaattac 
gtaacttctg 
atgaacaatt 
tttttacttt 
tagccaatag 
gtcttccttt 
gcagtggcac 
cctcagcctc 
tattttttgt 
gatgagagat 
ttcatatact 
ctgacagagg 
ggctccagtt 
tcggcatcag 
attcaggact 
agtttcactg 
agggattctc 
gtgcccagct 
gtctcgaacc 
gcatgagcca 
aagtatatag 
tgtcctttga 
gcctatgttt 
cctatgtttt 
tttgagttaa 
atttccagtt 
tggcactttt 
tcctgttcca 
ataagtttta 
ttttgtttct 
cataaaagac 
acattgtcat 
cccagtactt 
ctggccaacg 
gtgcacacct 
gaggaggagg 
gagactccat 
tcatgaatga 
actgttcaag 
aattgttttt 
agtgcagtgg 
ctgctcagcc 
tttttttgag 
ctcactgcaa 
ctgggaatac 
gggtttcacc 
cggcctccca 
ccttagttac 
ggctcacgcc 
ggagttcgag 
ttagtctggc 
aattgcttga 
gcttggggaa 
tttttatgtt 
atgcactatt 
tatatgtgaa 
ttcttgctta 
cgggccgggg 



cacggtccat 
ctccctgccc 
ttttttccaa 
ttaaaaagta 
tgtcataata 
attactgtaa 
tctcttagaa 
ttcaggctga 
tttttttttt 
aatctcggct 
cctaatagct 
agagacggga 
ctgacacatc 
ttcattcatc 
aggaaacagg 
gcccccagcc 
gcctgatctg 
gataagttct 
ttgttgccca 
ctgcctcagc 
aatttttata 
cttgacctca 
ccacacccgg 
tttgcagata 
tgcacaaaca 
ttgtcatcag 
cttctaagag 
tttttgtata 
ttcccagcac 
gtccaaaatc 
ttggtgtata 
aactcaggaa 
ggatcctttg 
agtttgaatt 
ctttacaata 
tgggaggctg 
tggtgaaacc 
gtaatcccag 
ttgcagtgag 
cttaaacaac 
aggatctgtc 
tcttgcacat 
cttttccttt 
cacaatcttg 
tcccaagtag 
atggagtctt 
gctccacctc 
aggtgcctgc 
atgtagccag 
aagtgctggg 
attttcaggc 
tgtaatccca 
accagcctgg 
atggtggcag 
acctgggagg 
caagagtgag 
aatcctgtat 
taggattttc 
tagagataat 
actgccttag 
gtggtggctc 



catccaatgt 
tatttttttc 
ttatgaatac 
actttttgga 
ggtaacacat 
tgtctctatt 
gaaatattta 
ccgtgtaacc 
ttgaaacagc 
cactgcaatc 
gatactacaa 
tttcaccata 
cctgtggtgc 
tgtgcaacag 
agtagaagaa 
ctcgttccgt 
aaagcttccg 
cttggatttg 
ggctagagta 
ctcccaagta 
tttttagtag 
ggtgatcccg 
cccaggattt 
tttgctccca 
ttttaaattt 
gtttaagaaa 
ttctgtagtt 
tgttattagg 
catttgttta 
aattggcaat 
tgtgccagta 
gtggtagtta 
caatttcata 
ttgatatgga 
ttaagccttc 
aggcgggcat 
ccgtctctac 
ctacaggaga 
ctgagatcat 
aacaataaca 
catttattta 
tcttggttaa 
ttttttttga 
gctcactgca 
ctgggatcac 
gctctgttgc 
ccgggttcac 
caccacgccc 
gatggtctcg 
attacagatg 
tgtttgttgc 
gcactttggg 
ccaacatggt 
gtgcctgtaa 
cggaggttgc 
acttcatctc 
tgattgctga 
gatatataca 
tgtagtcttt 
ctagaacttc 
acacctgtgt 



cgtcatttat 
tttcctgggc 
ttcatagttg 
agtccaagtc 
ttaatggtaa 
tccctaagcg 
ccaagccttc 
atccctagtt 
gtcttgctgc 
tccgcctccc 
gtgtgcactg 
ttacccaggt 
tccctctgga 
ccctgtaggt 
ctgagtggtc 
cccctgctcc 
gagcatctta 
cgttggacct 
caatggcacg 
gctgggatta 
aggcagggtt 
ccttggtttc 
ctttatatat 
ttgtttgggt 
tgatgcagtg 
ccacctcatc 
ttagctgtta 
tgagggtcca 
aaagactgct 
atatgtaagg 
ccacactgtt 
ttcaccattt 
tgaattttag 
ttgcatagaa 
tggctgggtg 
atcacttgag 
taaaaataaa 
gggtgaggca 
gccactgcac 
gaagaaaaaa 
cgtctttaat 
ataagtatta 
gacagagtct 
acctctgcct 
aggtgcctgc 
ccaggctgga 
accattcttc 
agctaatttt 
aactcttgac 
tgagccactg 
tagtatatag 
aggccaaggc 
gaaatccagc 
tcccatctac 
agtgagctga 
aaaaaaaaaa 
attggtttat 
atcatatata 
gtttctagtt 
aagtacgatg 
tcccagcact 



taatattttc 
cattttaaag 
catctctaat 
cgatgtgagg 
tacctcttcc 
catagcttta 
tagtaggtaa 
ctagttctag 
tctgtcaccc 
tggcccaagc 
ccacgcccag 
ctcgaattcc 
ccaggcactg 
aggccctgca 
cagggcttca 
acccagtgct 
cagacgtcca 
tttttttttt 
acctccacct 
caggcgcctg 
tcaccgtgtt 
ccaaagtgct 
tctggatatc 
tgtcttttca 
caatttattg 
catagttatg 
aatttaggtc 
ctttattctt 
ttttctccac 
gtttatttct 
ctgattatta 
gctcctcttt 
gatcggcttg 
tgtgtagatc 
tggtggctga 
gtcaggagtt 
aaacaaatta 
ggagaatcgc 
tccagcctgg 
acagtattaa 
ttctttcaac 
tttttgatgc 
tgctctgtca 
cccgggttca 
caccacaccc 
gtgaagtggc 
cgcctcagcc 
ttgtattttt 
ctcaggtgat 
tgcccggctc 
aaatacaagc 
gggtggatca 
ctctattaaa 
tcaggaggct 
gatcgcgcca 
aaaaagaaat 
tagtgctaat 
ttcaatatat 
tgcatggcat 
ttgaataaaa 
ttgggaggtg 



ccagtctcat 
caaattccag 
ctaactgatt 
acaaaaaaga 
atattcaaat 
tttttcctcc 
ttttcttttt 
ttctttgaat 
aggctggagt 
catcctccca 
ctaatttttg 
tgatcccttt 
ctccaagggt 
gtcacaccat 
aggctcagag 
gcttgccatg 
ccttgccacc 
tttaagatgg 
cctgggttca 
tcaccacgtg 
ggccaggctg 
gggattacag 
atcccttatg 
cttgatatag 
tttctttatt 
aggattttca 
tttgatccat 
ttgcatgtgg 
tgaatggtct 
gagctctctc 
tagctttgtg 
ttcaagtttg 
tccaattctg 
tgtttggggc 
cgcctgtaat 
caagaccagc 
gtcggaggtg 
ttgaacctgg 
gtaacagagg 
gtcttccaat 
agtattttgt 
ttctctaagg 
cccaggctgg 
agcaattctt 
agctaatttt 
ccaatcttgg 
tcctgagtcg 
agtagagatg 
ctgcctgcct 
gagttgtttt 
tgggcaccgt 
cctgtggtca 
aatacaaaaa 
gaggcaagag 
ttgcactcca 
acagtggatt 
aggatttttt 
acaattaata 
ttatttcttt 
gtgactagag 
gaagtgggca 



99660 
99720 
99780 
99840 
99900 
99960 
100020 
100080 
100140 
100200 
100260 
100320 
100380 
100440 
100500 
100560 
100620 
100680 
100740 
100800 
100860 
100920 
100980 
101040 
101100 
101160 
101220 
101280 
101340 
101400 
101460 
101520 
101580 
101640 
101700 
101760 
101820 
101880 
101940 
102000 
102060 
102120 
102180 
102240 
102300 
102360 
102420 
102480 
102540 
102600 
102660 
102720 
102780 
102840 
102900 
102960 
103020 
103080 
103140 
103200 
103260 



37 



WO 01/42434 



PCT/US00/33065 



gatcacttga gatcagcagt ttgagaccag cctggccaac acggcgaaac cccatctcta 103320 

ctaaaaatac aaaaattagc tgggtgaggt gatgtgcacc tgtagtccca gctacttgag 103380 

agggtgagac atgagaattg cttgaacctg gggggcggag gttgcagtga gccaagatca 103440 

tgccactcca ctccagcctg gacgacagag caagaaccct gtctttaaaa aaaaaaaaaa 103500 

aaaagtggct agaacaaaca tctttatctt gttcctgatc ttaggtggaa aacttttttg 103560 

ttcctgatat taggtggaaa acttttagtc tttcactgtt gaatatgatg ttacttgtag 103620 

gttttctgta gattcccttt atcgagttga ggaaattctc ttatattcat agtgtgttga 103680 

gtgtttttta tcatgaaagg gtgttgattt tttttttaaa gatagggtct tgttctgtca 103740 

cccaggctgg agggcagtgg catgatcatg gctcactgca acctcgaatt cctgggctca 103800 

ggggatcctc ctacttcatc ctcctgagta ggtgagacta caggcatgag ccaccatgcc 103860 

cagctaattt tttaattttt ctgtagaggt agggtcctgc tttgctgccc aggctggtct 103920 

taaactccag ggctcaagca atcctgcctc agcctcccaa agtgctgaga ttacaggggt 103980 

gagtcactgc actgcaccca gctgtgtggg atttttcaaa tgcttttttc ctttagatga 104040 

tcatgtgtgg tttttttcct ttcattttgt taatgtggta tattgatttt cgtatgttga 104100 

accatccttg aattcctcag ataaagcacg catattcatg gcgtattatc tctttattat 104160 

tatttttttt gtagagatga gatttcactc tgttgcccaa gctggtctca aactcctggg 104220 

ctaaagtgat cctcctgcct cagcctccga aagcgctggg attataggca tgagccactt 104280 

ggccctatct tttttctttt tctttttttt ttttttttga gacagagtct cactctgtcg 104340 

ccgggctgga gtgagtggcg cgatctcggc tcactgcaac ctccatctcc cgggttcaag 104400 

caattctcct gcctcagcct cctgagtagc tgggactaca ggtgcccgcc actatgccca 104460 

gctaattttt tgtgttttta gttgagacgg tgttttgcca tgttggacag gctggtcttg 104520 

cactcctgac ctcgtgattc acccaccttg gcctcccgaa gtgctgggat tacaggcatg 104580 

agccaccgca gcgagcctta tctttttaac agttaaaagt ttaaggcctt atcatgtaat 104640 

aacattgctg gatttgattt gctgctgttt tgttgagaat atttgcatct gtattgataa 104700 

gggatattgg tctgtagttt tcttttcttg gcatgtcttt gtatagcttt gatgccagca 104760 

taatattggc ctcatagaat gagttaggaa gtattcttta tattatggga agaggtaaaa 104820 

agggattggt gttaattctt cttcaaatgt ttgatagaat tcaacagtga agtgatatat 104880 

acaatcatat atatagagag agagagagag agagatggac ttttcttttg ttggaagttt 104940 

attgactatt gattcaattt ccttattgaa attgactttt ctttttggaa gctaaaatgt 105000 

ataactgtag tgaaagtttc tgaacttttc tttcattgga agttttttga ctactgattc 105060 

tttatttgtt ataggtctat tcagattttc tgtttcttct tgagtcagtt tggtctcgct 105120 

ctgtcgccca ggctggagtg cagtggtgcc atcttggctc actgcaactt ctacctcccg 105180 

agttcaagtg attctcccac ctcagcctcc ccagtatctc ggactacagg cgcacgccag 105240 

catacctggc taatttttgt atttttagta ggaacagcat ttcaccatgt tggccaggct 105300 

ggtctcgaac tcctgacctc aggtgatcca cccgcctcgg cctcacaaag tgctgggact 105360 

acagacataa gccaccgcgt ccagccttga gtcagtttag atagtttgca tgcatgtttc 105420 

taggaatttg tccattttgt ttatgttatc taatctgtta ccatacaatt gttcatagta 105480 

tccttttata gccctagtta tttctgtaag atcagtagta atagctccac tttctctctt 105540 

ggttttagca atttgagtca tctcttttct tcttcttttt ttttttttga gatggagtct 105600 

cactgtgtca cccaggctgg agtgcagtgg catgatcttg gctcactgca acccctgcct 105660 

cccaggttca agcaattctg ccttagcctc ctgagtagct gggattacag gtgtgagcca 105720 

ccacacccag ctagttttgt tttgtttttt tgtttttgag acggagtctg tttctgtctc 105780 

ccaggctgga gtgcagtggt gcaatctcac tcattgcaac ctccgactcc cagattccag 105840 

caattctcct gcctcagcct cccgagtagc tggaactata ggcgtgcacc accacgcctg 105900 

gctgattttt atatttttag tagagatggg atttcaccat gttggccagg ctggtcttgg 105960 

actccctacc tgaggtgatc cgcccacctt ggcctcccaa agtgctggga ttataggcat 106020 

gagccaccat gcccagccag tttttgtatt tttagtagag atggggtttc tccctgtcgg 106080 

ccaggctggt cttgaaatcc tgacctcagg ttatccacca gccttggcct cccaaagtgc 106140 

taggattaca ggcatgagcc accacgcatg gcctgtcttt tcttcttggt cattttcgct 106200 

aaaggtttgt caattttgtt gatctttttt gttgctgatc tctattgttt tcccattctg 106260 

tttcatttat ttccatttta acctttgttt ccttttttct gctggtttgg gtttaatttg 106320 

ctcttttttt cccctaattt ttcaaggtat acagttaagt tattgatttg agatctcttt 106380 

tttcttttct tttttttttt tttttttttt tttggttgct gttgagatgg agtctccctc 106440 

tgtcacccag actggagtgc agtggcatga tctcagctca ctgcagcctc cgccgcccag 106500 

gcgattctcc tgcctcagcc tcctgagtag acgtttcccg gccaaggtgt ttctttttga 106560 

atgtaagcat ttacagctac agatttccct ctaaacactg ctttcactgc attccataag 106620 

attgtttttt gttgtttttt gttgttgttt tgttgtttga gacacagtct cactctgttg 106680 

ccgtttggag agcagcgatg cgatcatagc tctgtagcct tgagctcctg gactcaatca 106740 

gtcctcctgc ctcagcctcc caagtagctg ggactacagg tgtacaccac tgcacctaac 106800 

taatttcttt tataagtttt tgcagaggcc aggcacagtg gctcacacct gtaatcccag 106860 

cactttggga ggccaaggtg ggtggatcac ctaaggtcag gagttcgaga ccagcctggc 106920 
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cgacagggag 
tgcctgtaat 
agaggttgca 
ctccatctca 
caggctggtc 
gagattatgg 
cattttcatt 
ttattcaaga 
tttttttttt 
cttgcctcac 
agtagctggg 
acgtgctggt 
ccaaagcact 
atagagagaa 
tccctcctcg 
tttcaagact 
cactttggga 
acatggtgaa 
gcctgtagtc 
gagcttgcag 
ccgtctcaaa 
tgagaaattg 
ttgctgtttt 
gtggatccct 
atgtctttca 
tttgtttctt 
tggtgtagca 
tcctcatttt 
ttctcaaatc 
gctccaaagt 
tttggttaga 
ttaaatttct 
gtcacccagg 
ttcaagtgat 
cacccaccta 
gtcttaaact 
cagatgtgag 
gtttttgtct 
tatttttaaa 
ctaggctgga 
gcgattctcc 
ggctaatttt 
gatctcgtga 
gagccaccgc 
gctgtagtgc 
gtctcccacc 
ggttgtgttg 
ctcctgggct 
agccaccgca 
ttctttgcat 
ttactttgaa 
ttgtcgtttc 
tcaggggtgc 
tctttaaata 
atgctaggac 
gcctggccag 
tggcacctgc 
agggggcgga 
gcgagactct 
ccaggttgcc 
cctcagcctt 



aaaccccatc 
cccagctact 
gtgagccagg 
agaaaagaaa 
tcaaactcct 
gcaagagcca 
gatctcaaga 
gtgtctttat 
gagaaagggt 
tgcaacctct 
attacaggca 
cagactggtc 
gggattacag 
ggtctcactc 
gcctcccaag 
ttaaaaatgc 
ggccgaggtg 
accctgtctc 
ccagctactc 
tgagctgaga 
aaaaaaaaaa 
gctgttaatc 
aggagattct 
caatttataa 
ttaaatttgc 
tctgtctgtt 
caggtccctt 
ggataaattc 
tgctgttgaa 
ttctatttgg 
catggttctt 
ttttatttag 
ctggcaggca 
tctcctgcct 
atttttggta 
actaacctca 
ccactgtgcc 
agtcattcaa 
aaattttttt 
gtgcaatggc 
tgcttcagcc 
ttgtattttc 
cctcatgatc 
gcccagccta 
agtggcacaa 
ttagtctcct 
ttactgtttt 
caagcagtca 
cccggcctgt 
gccttgtaat 
aatcagatcc 
tcgtttgttt 
ttgtttctgt 
tctagtggca 
tttgggaagt 
tatggtgaaa 
ctgtagtccc 
ggttgcagtg 
gtctcaaata 
tctgtactgg 
tacttctcgc 



tctactaaaa 
caggaggctg 
atcacaccat 
aaaaaaagtt 
gacttgaagg 
ccgcaccctg 
catcctaatc 
ttctgcatat 
cttgctctgt 
gcctcccggg 
cacaccacca 
tcgaattcct 
gcgtgaaaca 
aagcccaggc 
gtgctgagat 
catcttggct 
ggcagatcac 
tactaaaaat 
gggaagctga 
tcacaccact 
aaaatgtcat 
ttattgagga 
ctatctttgg 
gctacttgga 
aaagtttcag 
tgccgcttat 
gggctctagg 
agctgacctg 
acttctggtg 
tttctttctg 
ttgttttcct 
ttattaattt 
acgtcacaat 
cagcctccca 
tttttagtag 
ggtgatctgt 
cagcctcttt 
atgtctaggc 
taattttctt 
ttgatcttgg 
tcctgagtag 
agtagagaca 
ctcctgcctc 
ttttttattt 
ccctggcaca 
gagtagctag 
agacacaggg 
tcccaccttg 
taatttcttt 
tttttgttga 
tccgccctct 
agttactttc 
tcttttaggt 
aaaaggataa 
ggaggcgggt 
ccctgtctct 
agctactggg 
agctgagatt 
aaaaaaaaaa 
ggcaagtcct 
tcccatggag 



atacaaaaat 
aggcaggaga 
tgcactccag 
tttgcagaga 
agtcctactg 
ccacttggct 
tcccttttgt 
ttgtaaattt 
cgcccaggct 
ttcaagtgat 
cacctggcta 
gacctcaggt 
ccatgcccag 
tggtcttgaa 
tacaggtgtg 
gggcacggtg 
gaggtcagga 
acaaaaaatt 
agcaggagaa 
gtactccagc 
ctcactgcct 
acatttatat 
gtttcagcag 
gttcattgga 
ctactattct 
ggaactttat 
catttttctt 
tcctcaagtt 
aaattttcac 
tagtaattat 
ttagttcatt 
tttttttttt 
cttggctcac 
agtagctggg 
aaactgggtt 
ccgcctcagc 
ttttagtgta 
ttcctcagga 
ttttttttag 
ctcactgcaa 
ctgggactat 
tggttttgcc 
ggcctcccaa 
tttgagacaa 
ctgcagcctt 
aactacaagc 
tcttgctaca 
gcctcccaag 
atttccggtg 
aacctgcaca 
gcagggttca 
ctgacctttt 
tagtggttag 
agaggccggg 
ggatcacttg 
actaaaaata 
aagactgagg 
gcgccattgc 
aaaaaggata 
tcagtgtccg 
cctaaggatg 



tagctgggcg 
atcgcttgaa 
cctgggtaac 
cagggtatca 
cctcagcctc 
gttttgttct 
ttttttgttc 
tccaaaaaag 
ggagaatggt 
cctcccacct 
atttttgtat 
gatctgcccg 
cccccaattt 
ctcctgagct 
agtcacagta 
gctcacgcct 
gatcaagacc 
aaccaggtgt 
tggcgtgaac 
ctgggcaaca 
tctggtccaa 
attgactagt 
tttgattata 
cttcttggat 
ttgcatcttg 
gcatacattg 
tgttcttttt 
cactgtttct 
tacagttact 
cactttacta 
atccatggtt 
tgaagcgggg 
tacaacctcc 
attataggca 
tcaccacatt 
ctcccaaaat 
tttaaggtaa 
acagtttcta 
atggagtctc 
cctctgcctc 
aggtgcgtgc 
gtgttagcca 
agtgctggaa 
agtctccctc 
aaccgtccag 
atgtgccacc 
tttctctgac 
gtgttgagat 
aatgggccac 
atttgaagat 
ttgttgctgt 
taaataaaga 
cttgtgcttt 
cgcagtggct 
aggtcaggag 
caaaaattaa 
caggagaatc 
actccagcct 
aagagtgtct 
ccaggctgtt 
aaccagaggt 



tggtggcagg 
cctgggaggc 
aaaagcaaaa 
ctttgttgcc 
ccaaagtgct 
gttgtatttc 
gacttactgg 
tttttctttc 
ggtgcacaat 
cagccttccc 
tttagtctta 
ccttggcctc 
ttttttttta 
caagctgtca 
cctggccttc 
gtaatcccag 
accctggcta 
ggtggcaggt 
ccgggaggtg 
gtgcgagact 
tagtttctga 
cacttgtctc 
atgtatcagt 
gtgtaaattc 
aaatactagt 
atgtgcttca 
ttctttctgc 
ttcttcttcc 
gtacttttta 
gtattctcta 
tcctttattt 
tttcactctt 
gcctcctggg 
tgtgccacca 
ggccagactg 
gctgggatta 
ttgattgaaa 
ttaatttctt 
actctatagc 
ctgggttcaa 
caccactcct 
ggatggtctc 
ttacaggtgt 
tctcacccag 
gcttaagtga 
atgcctggct 
tggtcttgaa 
tacaggtgtg 
actttcttgt 
gataatgtgg 
ttgttgtgga 
ctatattctg 
gaaagagatt 
cacgcctgta 
tttaagatca 
ccgggcatgg 
gcttcaatcc 
gggcaacaga 
tccatccttt 
cacggctttt 
gaaagttgag 



106980 
107040 
107100 
107160 
107220 
107280 
107340 
107400 
107460 
107520 
107580 
107640 
107700 
107760 
107820 
107880 
107940 
108000 
108060 
108120 
108180 
108240 
108300 
108360 
108420 
108480 
108540 
108600 
108660 
108720 
108780 
108840 
108900 
108960 
109020 
109080 
109140 
109200 
109260 
109320 
109380 
109440 
109500 
109560 
109620 
109680 
109740 
109800 
109860 
109920 
109980 
110040 
110100 
110160 
110220 
110280 
110340 
110400 
110460 
110520 
110580 
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ggcctcctca 
tttccaagca 
gcctcttcct 
acaacttcag 
gttatcccag 
ccagcctggc 
tggtgggtgc 
gggaggcgga 
gtgagactcc 
aggctgcccc 
acactagaaa 
tctgacacca 
tggtagtagt 
cttcattgag 
ttttgacagt 
gtttactgtg 
gctgccaact 
ttttaagtta 
caatgtactt 
agtttcactc 
cgcctcccag 
acccccacca 
tcaggctggt 
tgggattaca 
gatctacttc 
acctcccttt 
ctctggcata 
gacacagctc 
tagctccatc 
tccatcctcc 
tcctcccttc 
cccttctcct 
tctcctctga 
ctctgacata 
gacatagctc 
tagctccatc 
ttgccatata 
tcaaaagtat 
ttttgggggc 
catgacgaaa 
ctgtagtccc 
ggaagtcaag 
ttatctcaga 
gaggctgagg 
tgaaaccccg 
atcccagcta 
cagtgagcaa 
aaaaaaaaaa 
aatcccaaag 
agcctgggca 
aaaaagtata 
ttctttcctg 
atttagaatt 
aatggtgcag 
tcagcctccc 
gtatttttag 
tcatgatctg 
gcccatttag 
aggctggagt 
cattctcctg 
ctaatttttt 



ggtgtttctg 
tgaacaggag 
tcctaggctt 
gtagtatcta 
cactttggga 
caacatggtg 
ctgtaatctc 
ggttgcagtg 
atctcgggga 
agccctgaga 
gatccaaatg 
gcaagccaca 
ggctgagcag 
cagtcatctg 
ttaatagaaa 
acagttttga 
ccatcatttt 
ttttcttagt 
cagattatta 
tgtcacccag 
gttcacgcca 
cacccggcta 
ctcgaactgc 
ggtgtgagcg 
cgttgtacta 
tcctctgaca 
gctccatcct 
catcctccct 
ctcccttctc 
cttctcctct 
tcctatgaca 
atgacacagc 
catagctcca 
gctccatcct 
catcctccct 
ctcccttctc 
tacatttatg 
tttgaaagaa 
tgaggtgggc 
cccatctcca 
agctatttgg 
gctgcagtga 
agtaaaggga 
tgagtggatc 
tctctactaa 
ctcgggaggc 
gatcgcacca 
aaaaagaaaa 
ccttgggagg 
acatagcgag 
cttgtttttt 
tgaatttgag 
tttttttttt 
tctcagatca 
aagtagctga 
tagagacagg 
cctgtcctgg 
aatttctttt 
gcagtggcac 
cctcagcctc 
gtatttttag 



agcccctgtc 
ctttccaaag 
ttctgtcagc 
cctgtaaatg 
ggccgaggcg 
aagccccgtc 
agctactcgg 
agctgagatc 
aaaaaaaaaa 
gagttcgagg 
cataagcagg 
tcagaaatac 
aaatagccca 
gttcgtagac 
aaagtttatt 
cagcagaata 
ggtgatatcc 
ggttgctcta 
ctaacttaac 
gctggagtgc 
ttctcctgcc 
attttgtatt 
tgacctcagg 
actgcactga 
tgacatagtt 
tagttccatc 
cccttctcct 
tctcctatga 
ctatgtcata 
ggcatagctc 
cagctccatc 
tccatcctcc 
tcctcccttc 
cccttctcct 
tctcctctga 
ctctgacata 
tatgttataa 
gttgcctgga 
agatcgcctg 
ccaaaattac 
gggaggatcc 
gctgagattg 
ctaggaatgg 
accggaggtc 
aaatacaaaa 
tgaggcaaga 
ctgcattaca 
gaaaagaaat 
ccaaggcagg 
actccatctc 
tgtatcatcc 
ttactgtcta 
tttgagacaa 
ctgcaacctc 
gactgcaggc 
gtttcactat 
cctcccaaaa 
tttttttttt 
gatctcggct 
ccgagtagct 
gagagatggg 



tagccccagc 
cccttagacc 
tctttgccca 
ccttcaggcc 
ggtgaattgc 
tctagtaaaa 
gaggctgaag 
gtgccattgc 
aaaaaaatgc 
gggtgaaaca 
attccttgag 
aggttgcctt 
gctgtcctcc 
accagagtta 
gtgacagttt 
gttgctttgc 
agctctgttg 
gagatgacaa 
acttaaagta 
aatggtgtga 
tcagcctcct 
ttta'gtagag 
tgatccgccc 
gcctaagtat 
ccccctccat 
ctccctcctc 
atgacacagc 
cacagctcca 
gctccatcct 
catcctccct 
ctcccttctc 
cttctcctct 
tcctctgaca 
ctgacatagc 
catagttcca 
gctccatccc 
cttcagctct 
ggcagtggct 
agccagggag 
aaaaaattag 
cagctaaggt 
tgccactgca 
tggcttttat 
aggagtttaa 
agtagccggg 
gaatcgcttg 
gcctagatga 
taagatctag 
aggatcactt 
tatttaaaaa 
atattttacc 
gtgtcatttc 
agtctcactg 
tgcctccctg 
acctgccacc 
gttggccagg 
tgctgggatt 
ttttgagatg 
cactgcgagc 
gggattacag 
gtttcaccat 



tgtgtgcatg 
ttcatgtagc 
tctgttgttg 
aggcgcggtg 
ttgaggtcag 
atacaaaaat 
caggagaatt 
actccagcct 
catcaacagc 
aaggcaagcc 
aaaaggtctg 
ccccatggct 
tgaaatttag 
cagaaaagtt 
tgacagctga 
tggagagacg 
ctgaattttt 
tgtgcatctt 
cagcattttt 
tctcggctca 
gagtagctgg 
atgaggtttc 
atcttggcct 
ggcaacgtgt 
tttcctatag 
ctatgacgtc 
tccatcctcc 
tcctcccttc 
cccttctcct 
tctcctatga 
ctatgacaca 
ggcatagctc 
tagctccatc 
tccatcctcc 
tcctcccttg 
ctcttctcct 
tcagcgttat 
tatgccttta 
ttggagacca 
tctggcatgg 
gggaggatca 
ctccagcctg 
ctctaatccc 
gaccagcctg 
tgtggtggtg 
aacctgggaa 
cagagcgaga 
acactgtggt 
gaggccagga 
agaaagaaat 
tttctttttt 
cttttagtct 
tgttgcccag 
gttagagtga 
acccccagcc 
ctggtctcga 
accatgagcc 
gggtctcgct 
tccgcctccc 
gcgcctgcca 
gttagccagg 



gccttctgga 
tcttttccca 
tccctccccc 
gctcatacct 
gagttcgaga 
tagctgggcg 
gcttgagcct 
gggcgacaga 
acgaccctgg 
cttcagggag 
tatcatccct 
acatgtgagc 
cagggtctta 
tattgggagg 
atagaaaaaa 
gatctttgga 
agctatgctg 
taacttacca 
ttttttatgg 
ctgcaacctc 
gactacaggc 
accatgttgg 
cccaaagtgc 
ctataacata 
cacagtccca 
ctcccttctc 
cttctcctct 
tcctctgaca 
ctgacacagc 
cacagctcca 
gctccatcct 
catcctccct 
ctcccttctc 
cttctcctct 
tcctctgaca 
tcatgtatta 
aattattgct 
actccagcac 
gcctgggcaa 
tggcacgcgc 
cttgagcctg 
ggtgcagatc 
agcactttgg 
gccaacatgg 
ggtgtctgta 
gcggaggttg 
ctctgcctaa 
tcatgcctgt 
gttcaacacc 
tcaaagagaa 
ttgccccttt 
gaagaacttc 
gctggagtgc 
ttttcctgcc 
aatttttttg 
attcatgacc 
accacgccca 
cttgtttccc 
•gggttcacgc 
ccacgcccac 
atggtcttga 



110640 
110700 
110760 
110820 
110880 
110940 
111000 
111060 
111120 
111180 
111240 
111300 
111360 
111420 
111480 
111540 
111600 
111660 
111720 
111780 
111840 
111900 
111960 
112020 
112080 
112140 
112200 
112260 
112320 
112380 
112440 
112500 
112560 
112620 
112680 
112740 
112800 
112860 
112920 
112980 
113040 
113100 
113160 
113220 
113280 
113340 
113400 
113460 
113520 
113580 
113640 
113700 
113760 
113820 
113880 
113940 
114000 
114060 
114120 
114180 
114240 
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tctcctgacc 
gccaccgcgc 
tatttgggaa 
attattaatt 
cattgtttct 
tttttctctt 
tatctaggtg 
catagattaa 
caaatttttt 
tgtgcgttct 



tcgtgatccg 
ccggctagaa 
tgtctttatt 
gatcattctt 
gatgagaagt 
gatgttttca 
tggatctctt 
tgttttgttt 
tcctgcccct 
eta 



cccgccttgg 
tttcttgtag 
tcagcttcat 
ttcagtgttt 
catctgtcaa 
agattttctc 
ggtgtgcatc 
tgttttacca 
ctctcatctc 



cctcccaaag 
gacaggcttg 
tttttgaagg 
aaaagtgtca 
attgtccctt 
tttgtctttg 
gtatttgggc 
aatttggaga 
cttttgggag 



tgctgggatt 
ctagcaacca 
atagtttagc 
tcatgctacc 
tgtacttgaa 
gectttagta 
ttcagtaagc 
gtttttactc 
taccactgca 



acaggcgtga 
attcagtgtt 
tggctataga 
ttctgggttc 
gaattatctt 
gtttgtgatg 
ctcttagatt 
atcatttcaa 
tgtatgttgg 



114300 
114360 
114420 
114480 
114540 
114600 
114660 
114720 
114780 
114793 



<210> SEQ ID NO:4 
<211> LENGTH: 25 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 
<400> SEQ ID NO: 4 

cacaggttca gcatgtttgt gcgtc 25 

<210> SEQ ID NO: 5 
<211> LENGTH: 25 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 
<400> SEQ ID NO: 5 

cacagtccct gctggcctct gtcta 25 

<210> SEQ ID NO: 6 
<211> LENGTH: 25 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 
<400> SEQ ID NO: 6 

caggacatct ccatcaagag getge 25 

<210> SEQ ID NO: 7 
<211> LENGTH: 25 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 
<400> SEQ ID NO: 7 

aataagaggg ggecaggate agtgc 25 

<210> SEQ ID NO: 8 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM : Artificial Sequence 
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<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 
<400> SEQ ID NO: 8 

gtgaatggca tcctggagag 20 

<210> SEQ ID NO: 9 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 
<400> SEQ ID NO: 9 

gtctccaggc agctcaacag 20 

<210> SEQ ID NO: 10 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 
<400> SEQ ID NO: 10 

accctgtccc tcctgtctga 20 

<210> SEQ ID NO: 11 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 
<400> SEQ ID NO: 11 

agaccctaag atgttcggag 20 

<210> SEQ ID NO: 12 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 
<400> SEQ ID NO: 12 

gatgacctgt gtgagttgcg 20 

<210> SEQ ID NO: 13 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 
<400> SEQ ID NO: 13 

cgcaactcac acaggtcatc 20 
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<210> SEQ ID NO: 14 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 14 
ggagtcaggt caaaggatgc 

<210> SEQ ID NO: 15 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 15 
gcatcctttg acctgactcc 

<210> SEQ ID NO:16 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 16 
ggtctgaaac gtgatctggg 

<210> SEQ ID NO: 17 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM : Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 17 
cccagatcac gtttcagacc 

<210> SEQ ID NO: 18 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 18 
cgatgatgtg tgggttctcc 

<210> SEQ ID NO: 19 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence . 
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<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 19 
ggagaaccca cacatcatcg 

<210> SEQ ID NO:20 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO:20 
cgtgtctgag aagtccagcc 

<210> SEQ ID NO:21 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO:21 
ggctggactt ctcagacacg 

<210> SEQ ID NO:22 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 22 
acagcatctt ctccacgcac 

<210> SEQ ID NO: 23 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM : Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 23 
agtcctctgg ctttgcagtg 

<210> SEQ ID NO: 24 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 24 
tgtgcgtgga gaagatgctg 
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<210> SEQ ID NO: 25 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO:25 
ggctggaaag ggaagtctac 

<210> SEQ ID NO:26 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Art if icial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 26 
tggttcaggt gctcttgggg 

<210> SEQ ID NO: 27 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Art if icial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 27 
cgtgaagcag gagttgagcc 

<210> SEQ ID NO: 28 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 28 
atcttgctct gggtcttccc 

<210> SEQ ID NO: 29 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM :Artif icial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 29 
cactgcaaag ccagaggact 

<210> SEQ ID NO:30 
<2U> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artif icial Sequence 
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<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 
<400> SEQ ID NO: 30 

ataagcaaga cgacgacctc 20 

<210> SEQ ID NO: 31 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 
<400> SEQ ID NO: 31 

ctattctgtt gggtgggttc 20 

<210> SEQ ID NO: 32 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 
<400> SEQ ID NO: 32 

cgtgcctcct gtgcttaccc 20 

<210> SEQ ID NO: 33 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 
<400> SEQ ID NO:33 

cagaccccaa ggtagctcag 20 

<210> SEQ ID NO:34 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 
<400> SEQ ID NO: 34 

ggaagaccca gagcaagatc 20 

<210> SEQ ID NO: 35 
<211> LENGTH: 780 
<212> TYPE: PRT 

<213> ORGANISM:Dictyostelium discoideum 
<400> SEQ ID NO:35 

Met Lys Phe Tyr He Glu Asp Leu Leu Val Tyr Phe Pro Tyr Ser Tyr 

15 10 15 

He Tyr Pro Glu Gin Tyr Ser Tyr Met Val Ala Leu Lys Arg Ser Leu 
20 25 30 
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Asp Asn Gly Gly Pro Cys He Leu Glu Met Pro Ser Gly Thr Gly Lys 

35 40 45 

Thr Val Ser Leu Leu Ser Leu He Ser Ser Tyr Gin Val Lys Asn Pro 

50 55 60 

Ser He Lys Leu He Tyr Cys Ser Arg Thr Val Pro Glu He Glu Gin 
65 70 75 80 

Ala Thr Glu Glu Ala Arg Arg Val Leu Gin Tyr Arg Asn Ser Glu Met 

85 90 95 

Gly Glu Glu Ser Pro Lys Thr Leu Cys Met Ser Met Ser Ser Arg Arg 

100 105 110 

Asn Leu Cys He Gin Pro Arg Val Ser Glu Glu Arg Asp Gly Lys Val 

115 120 125 

Val Asp Ala Leu Cys Arg Glu Leu Thr Ser Ser Trp Asn Arg Glu Ser 

130 135 140 

Pro Thr Ser Glu Lys Cys Lys Phe Phe Glu Asn Phe Glu Ser Asn Gly 
145 150 155 160 

Lys Glu He Leu Leu Glu Gly Val Tyr Ser Leu Glu Asp Leu Lys Glu 

165 170 175 

Tyr Gly Leu Lys His Gin Met Cys Pro Tyr Phe Leu Ser Arg His Met 

180 185 190 

Leu Asn Phe Ala Asn He Val He Phe Ser Tyr Gin Tyr Leu Leu Asp 

195 200 205 

Pro Lys He Ala Ser Leu He Ser Ser Ser Phe Pro Ser Asn Ser He 

210 215 220 

Val Val Phe Asp Glu Ala His Asn He Asp Asn Val Cys He Asn Ala 
225 230 235 240 

Leu Ser He Asn He Asp Asn Lys Leu Leu Asp Thr Ser Ser Lys Asn 

245 250 255 

He Ala Lys lie Asn Lys Gin He Glu Asp He Lys Lys Val Asp Glu 

260 265 270 

Lys Arg Leu Lys Asp Glu Tyr Gin Arg Leu Val Asn Gly Leu Ala Arg 

275 280 285 

Ser Gly Ser Thr Arg Ala Asp Glu Thr Thr Ser Asp Pro Val Leu Pro 

290 295 300 

Asn Asp Val He Gin Glu Ala Val Pro Gly Asn He Arg Lys Pro Ser 
305 310 315 320 

He Phe He Ser Leu Leu Arg Arg Val Val Asp Tyr Leu Arg Glu Pro 

325 330 335 

Asp Lys Ser Arg Leu Lys Ser Gin Met Leu Leu Ser Glu Ser Pro Leu 

340 345 350 

Ala Phe Leu Gin Gly Leu Tyr His Ala Thr Gin He Ser Ser Arg Thr 

355 360 365 

Leu Arg Phe Cys Ser Ser Arg Leu Ser Ser Leu Leu Arg Thr Leu Arg 

370 375 380 

He Asn Asp Val Asn Gin Phe Ser Gly He Ser Leu He Ala Asp Phe 
385 390 395 400 

Ala Thr Leu Val Gly Thr Tyr Asn Asn Gly Phe Leu He He He Glu 

405 410 415 

Pro Tyr Tyr Gin Arg Gin Asn Asn Thr Tyr Asp Gin He Phe Gin Phe 

420 425 430 

Cys Cys Leu Asp Ala Ser He Gly Met Lys Pro He Phe Asp Lys Tyr 

435 440 445 

Arg Ser Val Val lie Thr Ser Gly Thr Leu Ser Pro Leu Asp He Tyr 

450 455 460 

Thr Lys Met Leu Asn Phe Arg Pro Thr Val Val Glu Arg Leu Thr Met 
465 470 475 480 

Ser Leu Asn Arg Asn Cys lie Cys Pro Cys He Leu Thr Arg Gly Ser 

485 490 495 

Asp Gin He Ser lie Ser Thr Lys Phe Asp Val Arg Ser Asp Thr Ala 
500 505 510 
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730 






735 




Ser 


Leu 


Glu 


His 


Val 


Glu 


Lys 


Gin 


Ser Thr Ser Lys 


Pro 


Pro 


Gin 


Gin 








740 










745 




750 






Gin 


Asn 


Ser 


Ala 


lie 


Asn 


Ser 


Thr 


He Thr Thr Ser 


Thr 


Thr 


Thr 


Thr 






755 










760 




765 








Thr 


Thr 


Thr 


Ser 


Thr 


He 


Ser 


Glu 


Thr His Leu Thr 












770 










775 




780 











<210> SEQ ID NO: 36 

<211> LENGTH: 778 

<212> TYPE: PRT 

<213> ORGANISM: S. cerevisiae 



<400> SEQ ID NO: 36 
Met Lys Phe Tyr He 

1 5 
He Tyr Pro Glu Gin 
20 

Asp Val Gly Gly Asn 
35 

Thr Val Ser Leu Leu 
50 

Glu His Arg Lys He 
65 

Lys Ala Leu Val Glu 
85 

Leu Gly Tyr Gin Glu 
100 

Asn Leu Cys Leu His 
115 

Val Asp Glu Lys Cys 
130 

Leu Glu Glu Asp Pro 



Asp Asp Leu Pro Val Leu 
10 

Tyr Asn Tyr Met Cys Asp 
25 

Ser He Leu Glu Met Pro 
40 

Ser Leu Thr He Ala Tyr 
55 

He Tyr Cys Ser Arg Thr 
70 75 
Leu Glu Asn Leu Met Asp 
90 

Asp Phe Arg Gly Leu Gly 
105 

Pro Glu Val Ser Lys Glu 
120 

Arg Arg Met Thr Asn Gly 
135 

Glu Ala Asn Val Glu Leu 



Phe Pro Tyr Pro Lys 
15 

He Lys Lys Thr Leu 
30 

Ser Gly Thr Gly Lys 
45 

Gin Met His Tyr Pro 
60 

Met Ser Glu He Glu 
80 

Tyr Arg Thr Lys Glu 
95 

Leu Thr Ser Arg Lys 
110 

Arg Lys Gly Thr Val 
125 

Gin Ala Lys Arg Lys 
140 

Cys Glu Tyr His Glu 
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145 150 155 160 



Sen 


T.c*i i 


Tyr Asn 


lie 


Glu 


Vai 


Glu 


Asp 


Tvr 


Leu 


Pro 


Lys 


Glv 


Vai 


Phe 








165 










170 










175 




OCl 


Irilc 


Glu Lys 


Leu 




Lys 


±yi 


Cys 


Glu 


Glu 


Lys 


Thr 


Leu 


Cys 


Pro 






180 










185 










190 






ryr 


rile 


He Vai Arg 


A TTT 


Met 


He 


Ser 


Leu 




Asn 


lie 


He 


He 


Tvr 






195 








200 










205 








Ser 


Tyr 


His "Tyr 


Leu 


T 

lieu 


A cn 


Pro 


Lys 


lie 


Ala 


Glu 


Arg 


Vai 


Ser 


Asn 




z iu 








91 S 










220 










pi ii 

VjlU 


VaJ. 


Ser Lys 


Asp 


Del 


lie 




Tl» 

1 1C 


Phe 


Aon 


Glu 


Al a 


His 


Asn 


He 










97 n 










"J J 










240 


Asp 


Asn. 


Vai Cys 


lie 


pi ii 


Cor 
OC1 


Jueu 




T.oi i 
Jjcu 


Asp 


T.on 


Thr 


Thr 


Asp 


Ala 








245 










9 ^0 

c. J U 










« jj 




Leu 


Arg 


Arg Ala 


Thr 


Arg 


Vjiy 


Al a 


Asn 


Al a 

Aid 


T.on 




P.1 u 


Arg 


He 


Ser 






260 










9fiR 

60 J 










97n 








vai 


Arg Lys 


Vai 


Asp 


Ser 


pi n 
ijin 


Lys 


Leu 


PI n 


Asp 


Pin 


Tyr 


P.! ii 


Lys 






275 


















9ft 5 








Leu 


vai 


Gin Gly Leu 


nib 


Cor 


Ala 
nla 


Asp 


Tl p» 
lie 


L6U 


i in 


A cn 

AO Jul 


P.1 n 

ulll 


V3lli 


Glu 






















inn 










Pro 


Phe 


Vai Glu 


Thr 


Pro 


vai 


Leu 


Pro 


Pi n 
Vain 


Asp 


Leu 


Leu 


l 111 


Pi 11 


Al a 
Mia 










7 1 n 

jlU 










**1 ^ 

J 10 










■*9n 


Tie. 

lie 


pro 


Gly Asn 


lie 


Arg 


Arg 


Aid 


pi ii 


nis 


rne 


\/*i 
vai 


Ser 


Jrllc 


Leu 


Lys 








325 




















J J J 




Arg 


Leu 


He Glu 


Tyr 


Leu 


Lys 


Tnr 


Arg 


Mot- 
wet 


Lys 


vax 


Leu 


nib 


v^l 

vai 


Tl 0 

lie 






340 


























Ser 


L>1U 


Thr Pro 


Lys 


Ser 


rue 


Leu 


Pin 


Hi c 
nla 


Leu 


Lys 


P.! n 


Leu 


1 111 


Php 

true 






355 








■icn 


















Tic* 

lie 


blU 


Arg Lys 


Pro 


Leu 


Arg 


DVio 

iriie 


Cys 


Cor 

oei 


pi ii 

olU 


Arg 


Leu 


Cor 


Jbeu 


T.on 




j / u 








j f 0 










J O U 










vai 


Arg 


Thr Leu 


Glu 


vai 


i nr 


Pi n 
V?1U 


vai 


Pi ii 


Asp 


rile 


1 ill 


Al a 
nla 


T .01 1 




TOR 








ion 










•jqi: 












Asp 


Tie* 

ne 


Ala Thr 


Phe 


JV 1 a 
Ala 


i nr 


Leu 


Tl o 
1 lc 


Ser 


l ill 


Tyr 


Pi ii 


Pi n 


Pi vr 
uiy 


php 

£rlle 








405 










/tin 

4lU 










4lJ 




Leu 


Leu 


He He 


Glu 


Pro 


Tyr 


Pi n 


Tip 

1 lc 


pi ii 


Asn 


Ala 
Aid 


A 1 a 
rVia 


\7al 
vai 


Irl \J 








420 


























Pro 


Tic 

i ie 


Met Arg 


Phe 


1 111 


Cys 


Leu 


Asp 


Al P* 

/II Cl 


Qor 
Del 


Tip 
lie 


Al A 
nla 


Tl o 
lie 


T ,VQ 


Pro 






435 








4 a n 


















vai 


irfie 


Glu Arg 


Phe 




Ser 


vai 


lie 


Tie* 
lie 


l in 


Cor" 


P.1 V 


nil 


Tip 
lie 


Ser 




fiDU 








4 J J 










dfin 

ftOU 










Pro 


Leu 


Asp Met 


Tyr 


Pro 


Arg 


Met- 
ric L. 


Leu 


Asn 


Pho 
flic 




i in 


Vai 


Leu 


Gin 




















475 










480 


Lys 


Cor 


Tyr Ala Met 


Thr 


Leu 


Ala 


Lys 


Lys 


Ser 


Phe 


Leu 


Pro 


Met 


He 








485 










490 










495 




Tl P. 
lie 


l ill. 


Lys Gly Ser 


Asp 


Gin 


Vai 


Ala 


He 


Ser 


Ser 


Arg 


Phe 


Glu 


He 






500 










505 










510 






r\i y 


Asn 


Asp Pro 


Ser 


He 


Vai 


Arg 


Asn 


Tvr 


Glv 


Ser 


Met 


Leu 


Vai 


Glu 






515 








520 










525 








Phe 


Ala 


Lys He 


Thr 


Pro 


Asp 


Glv 


Met 


Vai 


Vai 


Phe 


Phe 


Pro 


Ser 


Tvr 




sin 








535 










540 










Leu 


Tyr 


Met Glu 


Ser 


He 


Vai 


Ser 


Met 


Trp 


Gin 


Thr 


Met 


Gly 


lie 


Leu 


545 








550 










555 










560 


Asp 


Glu 


Vai Trp Lys 


His 


Lys 


Leu 


He 


Leu 


Vai 


Glu 


Thr 


Pro 


Asp 


Ala 








565 










570 










575 




Gin 


Glu 


Thr Ser 


Leu 


Ala 


Leu 


Glu 


Thr 


Tyr 


Arg 


Lys 


Ala 


Cys 


Ser 


Asn 






580 










585 










590 






Gly 


Arg 


Gly Ala 


lie 


Leu 


Leu 


Ser 


Vai 


Ala 


Arg 


Gly 


Lys 


Vai 


Ser 


Glu 






595 








600 










605 








Gly 


He 


Asp Phe Asp 


His 


Gin 


Tyr 


Gly 


Arg 


Thr 


Vai 


Leu 


Met 


He 


Gly 




610 








615 










620 
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lie Pro 
625 

Phe Met 

Asp Ala 

Lys Asp 

Lys Arg 
690 
Asp Leu 
705 

Leu Arg 

Ser Val 

Asp Gin 

Glu Asp 
770 



Phe Gin 

Arg Glu 

Met Arg 
660 
Asp Tyr 
675 

Ser Gin 

Asn Leu 

Thr Met 

Trp Ser 
740 
Gly Gly 
755 

Glu Asp 



Tyr Thr 
630 
Asn Tyr 
645 

His Ala 

Gly Val 

Leu Pro 

Ser Thr 
710 
Ala Gin 
725 

Tyr Glu 
Phe He 
Glu Asp 



Glu Ser Arg 

Arg He Arg 

Ala Gin Cys 
665 

Met Val Leu 

680 
Lys Trp He 
695 

Asp Met Ala 

Pro Thr Asp 

Asp Leu lie 
745 

Glu Asn Glu 

760 
He Glu Met 
775 



lie Leu 
635 
Glu Asn 
650 

Leu Gly 

Ala Asp 

Ala Gin 

lie Ser 
715 
Pro Lys 
730 

Lys His 
Asn Lys 
Gin 



Lys Ala 

Asp Phe 

Arg Val 

Arg Arg 
685 
Gly Leu 
700 

Asn Thr 

Asp Gin 

Gin Asn 

Glu Gly 
765 



Arg Leu Glu 
640 

Leu Ser Phe 
655 

Leu Arg Gly 
670 

Phe Ser Arg 

Ser Asp Ala 

Lys Gin Phe 
720 

Glu Gly Val 

735 
Ser Arg Lys 
750 

Glu Gin Asp 



<210> SEQ ID NO: 37 
<211> LENGTH: 772 
<212> TYPE: PRT 
<213> ORGANISM: S. pombe 



<400> SEQ ID NO: 37 



Met 


Lys 


Phe 


Tyr 


lie 


Asp 


Asp 


Leu 


Pro 


lie 


Leu 


Phe 


Pro 


Tyr 


Pro 


Arg 


1 








5 










10 










15 




He 


Tyr 


Pro 


Glu 


Gin 


Tyr 


Gin 


Tyr 


Met 


Cys 


Asp 


Leu 


Lys 


His 


Ser 


Leu 








20 










25 










30 






Asp 


Ala 


Gly 


Gly 


lie 


Ala 


Leu 


Leu 


Glu 


Met 


Pro 


Ser 


Gly 


Thr 


Gly 


Lys 






35 










40 










45 








Thr 


He 


Ser 


Leu 


Leu 


Ser 


Leu 


lie 


Val 


Ser 


Tyr 


Gin 


Gin 


His 


Tyr 


Pro 




50 










55 










60 










Glu 


His 


Arg 


Lys 


Leu 


lie 


Tyr 


Cys 


Ser 


Arg 


Thr 


Met 


Ser 


Glu 


lie 


Asp 


65 










70 










75 










80 


Lys 


Ala 


Leu 


Ala 


Glu 


Leu 


Lys 


Arg 


Leu 


Met 


Ala 


Tyr 


Arg 


Thr 


Ser 


Gin 










85 










90 










95 




Leu 


Gly 


Tyr 


Glu 


Glu 


Pro 


Phe 


Leu 


Gly 


Leu 


Gly 


Leu 


Thr 


Ser 


Arg 


Lys 








100 










105 










110 






Asn 


Leu 


Cys 


Leu 


His 


Pro 


Ser 


Val 


Arg 


Arg 


Glu 


Lys 


Asn 


Gly 


Asn 


Val 






115 










120 










125 








Val 


Asp 


Ala 


Arg 


Cys 


Arg 


Ser 


Leu 


Thr 


Ala 


Gly 


Phe 


Val 


Arg 


Glu 


Gin 




130 










135 










140 










Arg 


Leu 


Ala 


Gly 


Met 


Asp 


Val 


Pro 


Thr 


Cys 


Glu 


Phe 


His 


Asp 


Asn 


Leu 


145 










150 










155 










160 


Glu 


Asp 


Leu 


Glu 


Pro 


His 


Ser 


Leu 


lie 


Ser 


Asn 


Gly 


Val 


Trp 


Thr 


Leu 










165 










170 










175 




Asp 


Asp 


He 


Thr 


Glu 


Tyr 


Gly 


Glu 


Lys 


Thr 


Thr 


Arg 


Cys 


Pro 


Tyr 


Phe 








180 










185 










190 






Thr 


Val 


Arg 


Arg 


Met 


Leu 


Pro 


Phe 


Cys 


Asn 


Val 


lie 


He 


Tyr 


Ser 


Tyr 






195 










200 










205 








His 


Tyr 


Leu 


Leu 


Asp 


Pro 


Lys 


lie 


Ala 


Glu 


Arg 


Val 


Ser 


Arg 


Glu 


Leu 




210 










215 










220 










Ser 


Lys 


Asp 


Cys 


lie 


Val 


Val 


Phe 


Asp 


Glu 


Ala 


His 


Asn 


He 


Asp 


Asn 


225 










230 










235 










240 


Val 


Cys 


lie 


Glu 


Ser 


Leu 


Ser 


lie 


Asp 


Leu 


Thr 


Glu 


Ser 


Ser 


Leu 


Arg 










245 










250 










255 
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Lys Ala Ser Lys Ser lie Leu Ser Leu Glu Gin Lys Val Asn Glu Val 

260 265 270 

Lys Gin Ser Asp Ser Lys Lys Leu Gin Asp Glu Tyr Gin Lys Leu Val 

275 280 285 

Arg Gly Leu Gin Asp Ala Asn Ala Ala Asn Asp Glu Asp Gin Phe Met 

290 295 300 

Ala Asn Pro Val Leu Pro Glu Asp Val Leu Lys Glu Ala Val Pro Gly 
305 310 315 320 

Asn lie Arg Arg Ala Glu His Phe lie Ala Phe Leu Lys Arg Phe Val 

325 330 335 

Glu Tyr Leu Lys Thr Arg Met Lys Val Leu His Val He Ala Glu Thr 

340 345 350 

Pro Thr Ser Phe Leu Gin His Val Lys Asp He Thr Phe He Asp Lys 

355 360 365 

Lys Pro Leu Arg Phe Cys Ala Glu Arg Leu Thr Ser Leu Val Arg Ala 

370 375 380 

Leu Gin He Ser Leu Val Glu Asp Phe His Ser Leu Gin Gin Val Val 
385 390 395 400 

Ala Phe Ala Thr Leu Val Ala Thr Tyr Glu Arg Gly Phe He Leu He 

405 410 415 

Leu Glu Pro Phe Glu Thr Glu Asn Ala Thr Val Pro Asn Pro He Leu 

420 425 430 

Arg Phe Ser Cys Leu Asp Ala Ser He Ala He Lys Pro Val Phe Glu 

435 440 445 

Arg Phe Arg Ser Val He He Thr Ser Gly Thr Leu Ser Pro Leu Asp 

450 455 460 

Met Tyr Pro Lys Met Leu Gin Phe Asn Thr Val Met Gin Glu Ser Tyr 
465 470 475 480 

Gly Met Ser Leu Ala Arg Asn Cys Phe Leu Pro Met Val Val Thr Arg 

485 490 495 

Gly Ser Asp Gin Val Ala He Ser Ser Lys Phe Glu Ala Arg Asn Asp 

500 505 510 

Pro Ser Val Val Arg Asn Tyr Gly Asn He Leu Val Glu Phe Ser Lys 

515 520 525 

He Thr Pro Asp Gly Leu Val Ala Phe Phe Pro Ser Tyr Leu Tyr Leu 

530 535 540 

Glu Ser He Val Ser Ser Trp Gin Ser Met Gly He Leu Asp Glu Val 
545 550 555 560 

Trp Lys Tyr Lys Leu He Leu Val Glu Thr Pro Asp Pro His Glu Thr 

565 570 575 

Thr Leu Ala Leu Glu Thr Tyr Arg Ala Ala Cys Ser Asn Gly Arg Gly 

580 585 590 

Ala Val Leu Leu Ser Val Ala Arg Gly Lys Val Ser Glu Gly Val Asp 

595 600 605 

Phe Asp His His Tyr Gly Arg Ala Val He Met Phe Gly He Pro Tyr 

610 615 620 

Gin Tyr Thr Glu Ser Arg Val Leu Lys Ala Arg Leu Glu Phe Leu Arg 
625 630 635 640 

Asp Thr Tyr Gin He Arg Glu Ala Asp Phe Leu Thr Phe Asp Ala Met 

645 650 655 

Arg His Ala Ala Gin Cys Leu Gly Arg Val Leu Arg Gly Lys Asp Asp 

660 665 670 

His Gly He Met Val Leu Ala Asp Lys Arg Tyr Gly Arg Ser Asp Lys 

675 680 685 

Arg Thr Lys Leu Pro Lys Trp He Gin Gin Tyr He Thr Glu Gly Ala 

690 695 700 

Thr Asn Leu Ser Thr Asp Met Ser Leu Ala Leu Ala Lys Lys Phe Leu 
705 710 715 720 

Arg Thr Met Ala Gin Pro Phe Thr Ala Ser Asp Gin Glu Gly He Ser 
725 730 735 



51 



WO 01/42434 



Trp Trp Ser Leu Asp Asp Leu Leu lie 
740 745 
Ser Ala Ala He Glu Gin Ser Lys His 

755 760 
Val Val Glu Thr 
770 

<210> SEQ ID NO: 38 

<211> LENGTH: 760 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 



<400> SEQ ID NO:38 



Met 


Lys 


Leu 


Asn 


Val 


Asp 


biy 


Leu 


Leu 


1 








5 










He 


Tyr 


Pro 


Glu 


Gin 


Phe 


Ser 


Tyr 


Met 








20 










Id 


Asp 


Ala 


Lys 


Gly 


His Gly 


Val 


Leu 


Glu 






35 










40 




Thr 


Val 


Ser 


Leu 


Leu 


Ala 


Leu 


lie 


Met 




50 










55 






Leu 


Glu 


Val 


Thr 


Lys 


Leu 


He 


Tyr 


Cys 


65 










70 








Glu 


Lys 


Val 


He 


Glu 


Glu 


Leu 


Arg 


Lys 










85 










Gin 


Glu 


Gly 


Glu 


Lys 


Leu 


Pro 


Pne 


Leu 








100 










1 AC 

IUd 


Lys 


Asn 


Leu 


Cys 


He 


His 


Pro 


Glu 


Val 






lib 










ion 
l<i U 




Asp 


Val 


Asp 


biy 


Lys 


Cys 


rilS 


oer 


Leu 




1 3V 










1 1 £ 






r<~\ 

kin 


Tyr 


(jin 


HI S 


Asp 


Thr 


oer 


Leu 


Pro 


1 A G> 










150 








rile 


Asp 


Ala 


144 C 

rtXS 


Gly Arg 




val 


XT X \J 










165 










Leu 


Asp 


Asp 


Leu 


Lys 


Ala 


Leu 


Vjj.y 


Arg 








1 DA 
ItJU 










lOD 




1*€U 


Ala 


Arg 


Tyr 


Ser 


He 


Leu 


His 






195 










200 




Tyr 


His 


Tyr 


Leu 


Leu 


Asp 


Pro 


Lys 


He 




210 










215 






Leu 


Ala 


Arg 


Lys 


Ala 


Val 


Val 


Val 


Phe 


225 










230 








Asn 


Val 


Cys 


He 


Asp 


Ser 


Met 


Ser 


Val 










245 










Asp 


Arg 


Cys 


Gin 


Gly Asn 


Leu 


Glu 


Thr 








260 










265 


He 


Lys 


Glu 


Thr 


Asp 


Glu 


Gin 


Arg 


Leu 






275 










280 




Val 


Glu 


Gly 


Leu 


Arg 


Glu 


Ala 


Ser 


Ala 




290 










295 






Leu 


Ala 


Asn 


Pro 


Val 


Leu 


Pro 


Asp 


Glu 


305 










310 








Gly 


Ser 


He 


Arg 


Thr 


Ala 


Glu 


His 


Phe 










325 










Leu 


Glu 


Tyr 


Val 


Lys 


Trp 


Arg 


Leu 


Arg 








340 










345 


Ser 


Pro 


Pro 


Ala 


Phe 


Leu 


Ser 


Gly 


Leu 






355 










360 




Arg 


Lys 


Pro 


Leu 


Arg 


Phe 


Cys 


Ala 


Glu 
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His 


Gin 


Lys 


Lys 


Ala 


Leu 


Lys 










750 






Glu 


Asp 


Glu 


Met 


Asp 


He 


Asp 








765 








Val 


ryx 


Phe 


Pro 


iyx 


Asp 


iyx 


10 










15 




ft iy 


\J A. LI 


T.oi i 

IiCU 




Ayrr 
/ix y 


Thr 


Leu 
















Mot- 
net 


Pro 


ocx 


v?xy 


1 liX 


Vjxy 


Lys 








** j 








Ala 


Tyr 




Arg 


Al a 


Tyr 


Pro 






60 










Car 
OCX 


Arg 


1 XIX 


vctx 


Pro 


Glu 


He 




7S 










80 


Leu 


Leu 


Asn 


XT 11C 


iyx 


Glu 


Lys 












95 






Leu 


A"l a 


Leu 


Cor* 

OCX 


Ser 


Arg 










110 






1 I1X 


Pro 


Xit^li 


A ytt 
rtx y 


Phe 


r*l v 


xjy o 








X^ «J 








1 I1X 


Al a 


Cor 


xyr 


vox 


Attt 


Al a 

Ala 
















nis 


uys 


Arg 


IrlJt; 


iyx 


oxii 


UlU 




155 










160 


L6U 


Pro 


Ala 


Gly 


He 


iyx 


Asn 


170 










175 




Arg 


n n 

V?XI1 




irp 




r X vJ 


iyx 










±J\J 






TV "I a 

j-\±cx 


Asn 


Val 


Val 


Val 


iyx 


Ser 








205 








Til a 


Asp 


lieu 


Val 
val 


Cor* 

ocx 


Lys 


Glu 






220 










A en 




Al a 


His 


Asn 


He 


Asp 




£t J J 










240 


Asn 


Leu 


Thr 


Arg 


Arg 


Thr 


Leu 


250 










255 




Leu 


Gin 


Lys 


Thr 


Val 


Leu 


Arg 










270 






Arcr 


Asp 


Glu 


Tvr 

JT 


Arg 


Arg 


Leu 








285 








Ala 


Arg 


Glu 


Thr 


Asp 


Ala 


His 






300 










Val 


Leu 


Gin 


Glu 


Ala 


Val 


Pro 




315 










320 


Leu 


Gly 


Phe 


Leu 


Arg 


Arg 


Leu 


330 










335 




Val 


Gin 


His 


Val 


Val 


Gin 


Glu 










350 






Ala 


Gin 


Arg 


Val 


Cys 


He 


Gin 








365 








Arg 


Leu 


Arg 


Ser 


Leu 


Leu 


His 
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370 



375 



380 



1 III. 


Leu 


pi i) 


He 


Thr 


Asp 


Leu Ala Asp Phe Ser 


Pro 


Leu 


Thr 


Leu 


Leu 












390 


395 










400 


Hid 


Asn 


f ne 


Ala 


Thr 


Leu 


Val Ser Thr Tyr Ala 




\jiy 


Phe 


Thr 


He 










405 




410 








415 




Tin 

lie 


j. J.e 


pi ii 

G1U 


Pro 


Phe 


Asp 


Asp Arg Thr Pro Thr 


TIP 
lie 


Ala 


Asn 


Pro 


He 








420 






425 






430 






L6U 


HIS 


rile 


Ser Cys 


Met Asp Ala Ser Leu Ala 


Tip 

1 1C 


Lys 


Pro 


Val 


Phe 














440 




445 










Arg 


irne 


Gin 


ber 


Val 


He He Thr Ser Gly 


Thr 


Leu 


Ser 


Pro 


Leu 




A C ft 

4bU 










455 












Asp 


lie 


Tyr 


Pro 


Lys 


He 


Leu Asp Phe His Pro 


Val 


1 ill 




Ala 


1 LIS. 


A C C 










470 


475 










low 


Pne 


Thr 


Met 


Thr 


Leu 


Ala 


Arg Val Cys Leu Cys 


Pro 


Met 


He 


Tlo 
lie 


oiy 










/toe 
485 




490 












Arg 


Gly 


Asn 


Asp 


Gin 


Val 


Ala He Ser Ser Lys 


Phe 


Glu 


Thr 


Arg 
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